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Injecting new life 
into anold vaccine 


The BCG vaccine is nearing its centenary. A 
new delivery method could protect more 
people from one of the world’s biggest killers. 


uberculosis is a leading cause of death world- 

wide and the biggest killer among people 

with HIV. Despite widespread efforts to boost 

vaccination and treatment, each year around 

10 million people become ill from the disease, 
and in 2018 some 1.5 million people died, according tothe 
World Health Organization (WHO). Astonishingly, the only 
licensed tuberculosis vaccine — BCG (Bacillus Calmette— 
Guérin), made from a weakened strain of the disease-caus- 
ing bacterium, Mycobacterium tuberculosis — has been 
in use for almost a century. Although BCG ably protects 
infants from systemic infection, it does less well against 
the lung disease that tuberculosis causes in adolescents 
and adults. 

Newer anti-tuberculosis medicines have been devel- 
oped, and more drugs and vaccines are in clinical trials. 
Nonetheless, the two most powerful drugs currently in 
use are not effective against multidrug-resistant strains 
of M. tuberculosis. 

Anew study in non-human primates (P. A. Darrah et al. 
Nature 577, 95-102; 2020) could breathe new life into the 
BCG vaccine. The key advance it reports is that delivery of 
the vaccine intravenously — directly into the bloodstream — 
rather than into skin or muscle, boosted protection against 
tuberculosis, particularly in the lungs. The translation of 
those findings into clinical practice is still some way off, 
but the pay-off could be huge. 

Inthe study, researchers in the United States compared 
the results when BCG was delivered in three different ways 
to separate groups of macaque monkeys (Macaca mulatta): 
intravenously; by injection into the skin; or through an 
aerosol spray. The team found that intravenous adminis- 
tration yielded more T cells — atype of immune cell — that 
responded to M. tuberculosis than did either of the other 
routes. Moreover, these T cells were found throughout 
the animals’ lungs. 

Six months later, some of the macaques were exposed 
to M. tuberculosis. Of those who received the vaccine intra- 
venously, nine out of ten animals were highly protected 
against disease and six showed no detectable signs of 
infection. By contrast, most of the macaques vaccinated 
by other routes exhibited hallmarks of infection. 

The results were a surprise: it was thought that intra- 
venous delivery of the vaccine might limit the severity 
of disease, but not prevent infection altogether. Instead, 
intravenous BCG delivery seems to trigger the establish- 
ment ofa population of T cells in the lungs that can rapidly 
neutralize M. tuberculosis. 
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The risks of 
injecting 
alive 
bacterium 
into the 
bloodstream 
need to 

be better 
understood.” 


Ifthese results hold up in subsequent studies, this would 
represent significant promise for tuberculosis control. A 
cheap and already widely used vaccine could confer greater 
resistance to disease with only a relatively small change to 
the way in whichit is administered. But there are anumber 
of hurdles to overcome. 

The first is safety. As with most vaccines, BCG contains 
live bacteria that have been weakened, making them unable 
toinfect people with robust immune systems. But the risks 
of injecting alive bacterium directly into the bloodstream 
— as opposed to injecting it into the skin— need to be better 
understood. If, for some reason, a vaccine became contam- 
inated by other bacteria or contained M. tuberculosis capa- 
ble of causing disease, an infection could quickly become 
life-threatening. Researchers must complete careful and 
exhaustive safety tests before carrying the technique over 
into clinical trials in humans. 


Further obstacles 


If the new approach is found to be safe, testing intrave- 
nous administration of the vaccine and then delivering it 
in people is likely to present further obstacles. It is more 
difficult to administer a vaccine intravenously than to give 
a quick jab into skin or muscle — and doing so will require 
extra funds to train medical personnel in countries where 
underfunded public-health systems already struggle to 
provide adequate primary health care. 

This lack of funding, and the fact that tuberculosis 
mostly affects people in poorer countries, is one reason 
why drug companies have not developed new treatments. 
However, newer players — particularly not-for-profit 
research groups — are changing drug discovery. One lead- 
ing vaccine candidate, M72/ASO1,, developed through a 
collaboration between a non-profit organization anda 
pharmaceutical company, protected adults with latent 
M. tuberculosis infection from going on to develop active 
tuberculosis for at least three years (D. R. Tait et. al. N. Engl. 
J.Med. 381, 2429-2439; 2019). And last summer, the US Food 
and Drug Administration approved a drug — also developed 
through not-for-profit research — as part of acombination 
totreat an extreme form of multidrug-resistant tuberculo- 
sis (Nature http://doi.org/dhvs; 2019). In all, the WHO says 
that there are 23 drugs, several combination therapies and 
14 vaccine candidates in clinical trials. 

Intravenous vaccines are being developed in other areas, 
too. One malaria vaccine called PfSPZ is due to be tested 
ina large clinical trial in 2,100 people this year in Bioko, 
Equatorial Guinea (Nature http://doi.org/gfzjq2; 2019). 

Existing tuberculosis treatments saved between 53 mil- 
lion and 64 million lives between 2000 and 2018. But it 
is unacceptable that so many people still fall ill with the 
disease. Under the United Nations Sustainable Develop- 
ment Goals, tuberculosis deaths need to be reduced by 
90% by 2030, and BCG vaccination will bea crucial part of 
this effort. Universal vaccination in infants has been rec- 
ommended in more than 150 countries — including many 
in Africa and southeast Asia, which have been particularly 
hard hit by the disease. A potential means of boosting a 
vaccine’s power warrants the world’s attention. 
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KATE FLANAGAN 


A personal take on science and society 


World view 


By Heidi Rehm 


New Year's resolution: one 
meeting-free week a quarter 


Protect time for deep thinking — it is crucial 
for productivity in a world of constant 
interaction, urges Heidi Rehm. 


ast December, I analysed how much time I’d 

devoted to meetings and conference calls during 

the previous three months. Discounting the days 

when I had been travelling or at conferences and 

offsite meetings, I had averaged 8.5 hours a day. 
This year, I havea plan to protect work time: I’m scheduling 
one week a quarter during which meetings and conference 
calls are not allowed in my research group. So far, eight 
other groups have joined in this plan, and we expect oth- 
ers to come on board. More than 500 people expressed 
support for my call on Twitter. 

I split my time between academic and clinical work, dis- 
covering the causes of rare disease, implementing genom- 
ics in health care, and building tools to support genomic 
medicine. I usually start work at 6 a.m., which allows me 
2 hours of thoughtful work time: that includes following 
up one-mails that demand special attention, and preparing 
for lectures and meetings, which typically begin at 8 a.m. 1 
attend to Slack and e-mail communications inthe evening. 
Writing and editing, preparing new presentations, and 
other more-substantive projects are for weekends and 
flights. That leaves little time for major projects, writing 
grants or doing deep critical or innovative thinking. This 
is what I want meeting-free weeks for. 

Istumbled across the idea by accident. Several years ago, 
my holiday plans fell through, but I left the block on my 
calendar, preventing any meetings from being scheduled. 
When the week arrived, I confessed that I planned to spend 
the time at home with my children, who were on holiday 
from school. But with my younger son at basketball camp 
and my older daughter determined to be independent, I 
found myself free to dive into work with little disruption. 
I caught up on the long list of tasks that had piled up, par- 
ticularly writing and editing. I got my e-mail inbox down 
to zero by the end of the week, and I also had time to watch 
films with my family in the evenings. It was magnificent! 

I began doing this at least one week a year. Members of 
my lab quickly worked out what was going on, sol stopped 
pretending and made it clearer which weeks were actual 
holidays (when I really am mostly off the grid), and when 
it was ‘Heidi’s working at home without meetings’ week. 
They respected my plan because they knew that I would 
get the things done that they needed from me. 

Other groups at the Broad Institute in Cambridge, 
Massachusetts, including the Genomics Platform, the 
Data Sciences Platform and the lab of Daniel MacArthur, 
began implementing one meeting-free week per quarter. 
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Block out 
the weeks on 
the calendar 
immediately.” 


Heidi Rehm 

is the chief 
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Broad Institute’s 
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It has been extremely positive, with people reporting 
being highly productive and able to dive into larger, some- 
times daunting, projects that require longer stretches of 
dedicated attention. However, because more than 
three-quarters of my meetings and calls are outside the 
institute, those weeks still had at least 30 hours scheduled. 

So, late last year, I extended the call to the broader 
genomics community. | crafted a proposal that avoids 
clashes with major genomics meetings and alternates 
which week of the month is free, to avoid repeated con- 
flicts with monthly meetings (go.nature.com/2en531ly). 
Several groups have since told me that they are working to 
implement the practice in their own programmes and labs. 

My suggestion is to block out the weeks on the calendar 
immediately. Next, start cancelling and rescheduling meet- 
ings booked for those weeks. Standing weekly meetings 
can almost certainly be shed, as can many monthly ones, 
which will be disrupted only once a year. It is OK to keep 
very difficult-to-schedule meetings. Once the week arrives, 
ad hoc meetings are allowed, at the discretion of the group 
head, but these should have the specific purpose of accom- 
plishing the work planned for the meeting-free week. 

Although] have not found studies comparing approaches 
to curbing meetings, several surveys suggest that we are 
overly burdened withthem. A survey of 182 senior managers 
across a range of industries found that about two-thirds 
felt that meetings cut into deep thinking and kept them 
from completing their own work (L. A. Perlow etal. Harvard 
Business Rev. 62-69; July-August 2017). 

Of course, four weeks per year is a paltry amount of time 
for people whose job it is, essentially, to be innovative. 
Researchers, research groups and work places need to 
brainstorm their own systems to prevent meetings from 
degrading productivity, rigour and innovation. Some peo- 
ple carve out one day per week without meetings, others a 
period of time each day. Some companies arrange company- 
wide protected time. I encourage every person and pro- 
gramme tofind asystem that works for them and stick toit. 

In addition to taking time out from meetings, notice 
other barriers to productivity, perhaps e-mail or social 
media. Create periods of time that are free from these 
distractions or give them up entirely. Take a sabbatical if 
achange of environment is needed. 

Personal strategies aside, with today’s extensive, 
globe-spanning collaborations, it makes sense to develop 
an international, coordinated effort across programmes 
andinstitutions, and find four weeks a year to align meeting- 
free time globally. 

lencourage everyone to enjoy these quarterly fasts. Let’s 
report back to one another at the end of the year. Perhaps 
it will help to bring the research community together more 
deeply. 
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The world this week 


Newsin brief 


IRAN TAKES PIVOTAL 
NUCLEAR STEP 


Iran has announced that it 
will no longer abide by any 
restrictions on its uranium- 
enrichment programme, 
prompting fears that the 2015 
deal to limit the country’s 
nuclear activities is effectively 
dead. Experts say that if Iran 
resumes enrichment at full 
throttle, it could be capable 
of creating anuclear bombin 
one year. 

The step is widely seen as 
aresponse to the 3 January 
US drone strike on Baghdad that 
killed visiting Iranian military 
commander Qasem Soleimani, 
the architect of Iran’s military 
operations around the world. 

Iran struck the 2015 nuclear 
deal, the Joint Comprehensive 
Plan of Action (JCPOA), with six 
global powers on the condition 
that punishing international 
sanctions would be lifted. It 
agreed to drastic cuts inits 
ability to produce enriched 
uranium and plutonium, 
which can be used to builda 
nuclear bomb. In particular, 
the deal limited the number 
of centrifuges Iran could use 
to separate uranium isotopes, 
and the amount and purity of 
uranium-235 the country could 
stockpile. 

The United States unilaterally 
pulled out of the JCPOA in 
2018 and resumed harsh 
economic sanctions against 
the country. In May 2019, Iran 
announced partial removal of 
its restrictions. On 5 January, 
foreign minister Mohammad 
Javad Zarif tweeted that there 
will be no more limits on the 
numbers of centrifuges. He 
added that Iran will continue 
to allow international nuclear 
inspections and that it will revert 
to full compliance with the deal 
ifthe other parties do the same. 
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SCI-HUB PROBED 
FORALLEGED 
LINKS TORUSSIAN 
INTELLIGENCE 


The operator of Sci-Hub, a 
website that provides people 
worldwide with illicit access to 
alarge volume of copyright- 
protected academic literature, 
is reportedly being investigated 
by the US Department of Justice 
(DOJ) on suspicion of stealing 
US military secrets. 

The agency suspects 
Kazakhstan-born Alexandra 
Elbakyan, who created Sci-Hub 
in 2011, of running the pirate 
site with silent approval from 
the Russian government and, 
possibly, with support from 
military intelligence. The 
Washington Post reported the 
allegations on19 December, 
citing anonymous sources close 
tothe DOJ. The DOJ told Nature 
that it could neither confirm 
nor deny the existence of an 
ongoing investigation. 

Sci-Hub allegedly operates 
from servers in Russia. Critics 
say that Elbakyan, acomputer 
scientist whose whereabouts 
are unknown, uses advanced 
hacking and phishing practices 
to acquire login details illegally 
from libraries and personal 
subscribers. Elbakyan previously 
told Nature that she runs Sci-Hub 
alone as a private enterprise, 
with financial help from 
anonymous donors. She did not 
respond to Nature’s request for 
comment on her alleged links to 
Russian intelligence. 
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UK GOVERNMENT 
CALL FOR SCIENCE 
“WEIRDOS' PROMPTS 
CAUTION 


Researchers have reacted with 
surprise to a bizarrely worded 
job advertisement posted bya 
senior UK government adviser. 
The notice calls for scientists, 
mathematicians and “super- 
talented weirdos” to work for 
the prime minister, and cites 
several scientific papers. 

Dominic Cummings 
(pictured), a political strategist 
who is chief special adviser to 
Prime Minister Boris Johnson, 
posted the advertisement on his 
personal website on 2 January. 
Cummings, known for his strong 
and sometimes controversial 
views on science, says that he is 
seeking “data scientists, project 
managers, policy experts, 
assorted weirdos’, as well as 
“unusual” mathematicians, 
physicists and economists, to 
work in the prime minister’s 
office in Downing Street. The 
post attracted attention for 
its brash language and odd 
requirements, suchas that 
applicants show excitement 
about a slew of specified 
scientific ideas. 

Researchers, including the 
authors of some of the cited 
papers, welcomed a focus 
on data-driven techniques 
and scientific skills from the 
top level of government, 
but cautioned against 
oversimplifying how science is 
applied to policymaking. 
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Bush fires 
‘twice the size 
of Belgium’ 
rage across 
Australia 


Sign up to get essential science 
news, opinion and analysis 
delivered to your inbox daily. 
Visit go.nature.com/newsletter 


Nearly half a billion native animals are estimated 

to have perished in bush fires in New South Wales, 
the Australian state most affected by the ongoing 
disastrous fires, and where this image was captured. 

The total number of animals killed is likely to be much 
higher across the country, where conflagrations have 
been burning in multiple states since September. 

The fires have so far burnt through at least six million 
hectares of land — more than twice the size of Belgium 
— across five states. In three states alone — New South 
Wales, Victoria and South Australia — fires have claimed 
at least 24 human lives and destroyed more than 
1,800 homes. 

The scale of the disaster prompted the federal 
government to call onthe country’s defence forces to 
help fire-affected communities. 

On 3 January, residents of coastal towns in New South 
Wales and Victoria were told to leave ahead of worsening 
conditions, leading to one of the country’s largest mass 
evacuations on record. About 160 fires were burning 
across those states as Nature went to press. 


RUMOURED CHANGES 
TOUS GOVERNMENT 
OPEN-ACCESS POLICY 


Arumour that the White 

House is considering a policy 
that would make all federally 
funded studies free to read on 
publication has fuelled a debate 
between scientists who favour 
open access and publishers of 
subscription journals. 

According to the widely 
discussed rumour, whose origin 
is unclear, the administration 
of President Donald Trump is 
drafting an executive order 
that would force the change in 
publishing practices. 

On 18 December, two groups 
that represent publishers — 
the Association of American 
Publishers in Washington DC 
and the International 
Association of Scientific, 
Technical, and Medical 
Publishers in Oxford, UK — sent 
letters to the US government 
opposing any such policy. It 
would hinder the peer-review 
process, stifle innovation and 
tip the publishing business into 
chaos, they wrote. 

But several scientists who 
advocate open access countered 
the publishers’ arguments. “I 
welcome the rumored policy,” 
tweeted John Wilbanks, chief 
commons officer at the non- 
profit research organization 
Sage Bionetworks in Seattle, 
Washington. “I work at a *really 
well funded* non-traditional 
research organization. We still 
can’t afford journal access 
subscriptions.” 

Kristina Baum, a 
spokesperson for the White 
House Office of Science and 
Technology Policy, declined to 
comment onthe rumour. 

The rumour follows an effort 
led by European funders, called 
Plan §, that will require that 
research they fund be made 
open access on publication, with 
liberal licensing terms. 


/ 
HEAD OF ANCIENT- 
DNALAB SACKED 
FOR MISCONDUCT 


The University of Adelaide has 
fired Alan Cooper, the high- 
profile leader of its Australian 
Centre for Ancient DNA, for 
“serious misconduct”. His 
dismissal follows allegations 
that Cooper bullied staff and 
students, and an investigation 
into the ‘culture’ of the centre. 

Cooper is a major figure in 
the field of ancient DNA, and 
has charted the migrations of 
prehistoric people and their 
domestic animals around the 
globe. 

Ina statement, the university 
said it would not comment on 
the circumstances that led to 
Cooper’s dismissal, but said 
that the move did not relate to 
the quality or integrity of his 
research or any other research 
conducted at the centre. 

“The university takes 
seriously its commitment to the 
welfare of students and staff,” it 
also said. 

In response to his dismissal, 
Cooper told Nature that he 
rejects the allegation that he was 
a bully. “I work at the highest 
international levels, and want 
my students and staff to do the 
same. I’ve occasionally been 
too blunt in my language and 
actions, and regret this — but 
it was never bullying,” he says. 
Cooper says that he was never 
warned about his conduct 
before action was taken against 
him, and that he was never asked 
to attend management courses. 
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The world this week 


News in focus 
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The devastating fire at Notre-Dame de Paris last April destroyed the cathedral’s roof and spire. 


Ne 


THE HUGE SCIENTIFIC 
EFFORT TOSTUDY 
NOTRE-DAME'S ASHES 


Last year’s fire at Paris’s beloved cathedral shocked the world. Now, researchers 
are making use of the unprecedented opportunity to study its materials. 


By Philip Ball 


he fire that destroyed large sections 
of the iconic cathedral Notre-Dame 
de Paris last April was a national 
tragedy. Now, months on, scientists 
with the French national research 
organization CNRS are embarking ona multi- 
million-euro effort to study the 850-year-old 
building and its materials with the goal of 
illuminating how it was constructed. With 
unprecedented access to the cathedral’s 
fabric — including timber, metalwork and 
the building’s foundations — in the wake of 
the fire, scientists also hope that their work 


will arm them with information to help the 
restoration. 

The research could “write a new page in 
the history of Notre-Dame, because there 
are currently many grey areas,” says Yves 
Gallet, a historian of Gothic architecture at 
the University of Bordeaux-Montaigne, who 
is in charge of a 30-strong research team 
investigating the masonry. 

Construction of the cathedral, considered 
one of the finest examples of the French 
Gothic style, began in the twelfth century. 
The structure was modified in the Middle 
Ages and extensively restored in the nine- 
teenth century. But it has been the subject of 
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surprisingly little scientific research compared 
with other Gothic monuments in France 
and elsewhere, says Martine Regert, a bio- 
molecular archaeologist at the CNRS’s CEPAM 
centre for the study of historical cultures 
and environments in Nice, who is one of the 
project’s leaders. Many questions remain 
about the structure, such as which sections 
are medieval and whether architect Eugéne 
Viollet-Le-Duc, who supervised the nine- 
teenth-century renovation, reused some of 
the older materials, says Regert. 

The fire on 15 April, possibly caused by an 
electrical fault, destroyed the cathedral’s 
roof and spire, and caused part of its vaulted 
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ceiling to collapse. The walls still stand, and the 
building will eventually be restored — although 
this is likely to take longer than the ambitious 
five years initially forecast, and is set to cost 
hundreds of millions of euros. 

But, until then, the interior of the building 
holds piles of debris: fallen stonework, 
burnt timbers and damaged metal artefacts, 
all now available for scientific study. The 
absence of tourists might also make it pos- 
sible to use radar to probe the foundations, 
which have been little investigated before. 
Even some parts of the structure that were 
largely undamaged are now more accessi- 
ble for inspection, says metals specialist 
Philippe Dillmann of the CNRS Laboratory for 
Archaeomaterials and Alteration Forecast- 
ing in Gif-sur-Yvette, who is coordinating the 
project with Regert. 


Architectural investigations 


The CNRS project will focus on seven topics: 
masonry, wood, metalwork, glass, acoustics, 
digital data collection and anthropology. Inall, 
the effort will involve more than 100 research- 
ers in 25 laboratories and will last for 6 years. 

Gallet’s team will study Notre-Dame’s 
stones to identify the quarries that supplied 
them and “reconstruct the supply networks 
and the economy of the site”. Studying the 
mortar used to bind the stones together could 
reveal how different compositions were used 
for the various structural elements — vault- 
ing, walls and flying buttresses. A better 
knowledge of the historical materials could 
inform choices made in restoration, says 
Gallet. 

The team will also analyse weaknesses in 
the remaining structure caused by the high 
temperatures of the fire, and by the fall of 
masonry and the water used to extinguish the 
flames. A radar study will determine how solid 
the foundations are before restorers erect 
scaffolding to allow them to dismantle the 
unstable remnants of the nineteenth-century 
spire. 

Meanwhile, ateam of about 50 will focus on 
Notre-Dame’s famous woodwork — especially 
the timbers inthe roof space above the vaults, 
dubbed the forest — which has either burnt 
away or lies charred in the nave. These black- 
ened remains could be tremendously valuable 
to researchers. 

“The burnt structure constitutes a gigantic 
laboratory for archaeology,’ says dendrologist 
Alexa Dufraisse of the National Museum of 
Natural History in Paris, who will lead the 
multidisciplinary wood team. 

“Wood is an extraordinary source of 
information,’ says Regert. Initial observations 
have confirmed that the ‘forest’ is made of oak, 
but studies will pinpoint the exact species 
used and give researchers clues about the 
techniques and tools of medieval timber 
construction. 
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Tree-ring dating of timber beams could 
reveal the year and location in which the 
trees were felled, filling in gaps in knowledge 
about the sequence of construction. “Each tree 
records within its tissues the environment in 
which it has grown,” says Dufraisse. This kind 
of study “could never have been conducted 
without the destruction of the structure by 
fire”, she says. 

In particular, says Regert, the wood is a 
climate archive. “Isotopic analyses of oxygen 


“The burnt structure 
constitutes a gigantic 
laboratory for 
archaeology.” 


and carbon in the rings make it possible to 
determine the temperature and rainfall over 
time,” she says. The trees used in Notre-Dame 
grew between the eleventh and thirteenth 
centuries, during a warm period known as 
the medieval climate optimum, offering a 
reference period for natural climate warm- 
ing to compare with anthropogenic warming 
today. 


Access all areas? 

Collecting and excavating the materials for 
analysis is challenging. There are three main 
piles of debris — inthe nave, the northtransept 


and the crossing between the two — as well as 
material still on top of the remaining vaults. 
But these are currently off-limits to people 
for safety reasons, Dillmann says — so robots 
and drones must do all the collecting. Some 
of this material might ultimately be reused in 
restoration. 

The collection and analysis will need to be 
documented precisely and thoroughly. Livio 
de Luca, a specialist in digital mapping of 
architecture at the CNRS’s Mixed Research 
Unit in Marseille will lead a team dedicated 
to creating a “digital ecosystem” that sum- 
marizes both the scientific research and the 
current and previous states of the cathedral, 
drawing onthe work of scientists, historians, 
archaeologists, engineers and curators — and 
perhaps even on old tourist photos of the 
structure. 

“It will be like a ‘digital twin’ of the cathedral, 
able to evolve as the studies progress,’ de Luca 
says. It will include online models for 3D 
visualization of the building andits attributes, 
created from billions of data points, with 
the history and evolution of the structure 
superimposed on the spatial map. 

As well as deepening researchers’ 
understanding of this monumental building, 
Regert hopes that the scientific studies will 
be useful when its ravaged vaults rise again. 
The results, she says, might “illuminate the 
choices that society will have to make for 
the restoration”. 


WHAT CRISPR-BABY 
PRISON SENTENCES 
MEAN FOR RESEARCH 


Chinese court sends strong signal by punishing 


He Jiankui and two colleagues. 


By David Cyranoski 


Chinese court has sentenced 

He Jiankui, the biophysicist who 

announced that he had created the 

world’s first gene-edited babies, to 

three years in prison for “illegal 

medical practice”, and handed down shorter 

sentences to two colleagues who assisted him. 

The punishments put to rest speculation over 

whether the Chinese government would bring 

criminal charges for an act that shocked the 

world, and they are likely to deter others from 
similar behaviour, say Chinese scientists. 

There has been much speculation about 

whether other scientists would follow in 

He‘s footsteps, especially given the relative 
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ease of using the most popular gene-editing 
tool, CRISPR-Cas9. But the punishments 
are “definitely a deterrent to similar mis- 
conduct in China”, says Wei Wensheng, a 
gene-editing researcher at Peking University 
in Beijing. 

On 30 December, a court in Shenzhen 
announced that He and two colleagues had 
flouted regulations and research ethics by 
altering genes in human embryos that were 
then implanted into two women, according 
to Xinhua News Agency. One woman gave 
birth to twin girls in late 2018; the court said 
a third baby has been born but did not say 
when, a revelation that fits with He’s claim in 
November 2018 that asecond woman had been 
implanted with a gene-edited embryo. 
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The court fined He 3 million yuan 
(US$430,000). Collaborators Zhang Renliand 
QinJinzhou received less severe punishments. 

The health ministry has also banned the 
researchers from working with human repro- 
ductive technology again, and the science 
ministry has banned them from applying for 
research funding, according to Xinhua. 

Scientists in China who are currently 
researching into CRISPR’s potential to treat 
various genetic diseases by modifying cells 
other than those of embryos say that they fear 
He’s actions might have a detrimental effect on 
their work, too, eventhough itis not as ethically 
contentious. He shocked the world’s scientists 
in November 2018 when he announced that his 
team at Southern University of Science and 
Technology in Shenzhen had used CRISPR to 
edit DNA in human embryos to make them less 
susceptible to HIV. The edits were designed to 
disrupt a gene that encodes a protein allowing 
HIV to enter immune cells. 

Scientists condemned He’s actions, saying 
that gene-editing technology was too prema- 
ture to be used for reproductive purposes. 
They also said the experiment was problem- 
atic because it risked introducing a mutation 
with potentially harmful effects while offering 
little benefit — the babies were not at high risk 
of contracting HIV. In the wake of the scandal, 
researchers called for a moratorium on gene 
editing inembryos and germline cells. 

At the time, Chinese law academics told 
Nature that He could face a range of criminal 
charges, including practising medicine with- 
out adequate qualifications, forging ethics 
documents and skirting laws banning the 
use of assisted reproductive technologies in 
people with HIV. He was fired by his university 
inJanuary last year. 

The court’s announcement puts to rest 
the suspicions of some researchers that the 


government would not bring a criminal case 
against He because of the increased media 
attention it would generate, says Tang Li, a 
science-policy researcher at Fudan University 
in Shanghai. Discussion of He’s experiments 
was widely censored on Chinese social media. 
But Tang says the disclosure of the court’s 
result demonstrates China’s commitment on 
research ethics. This is a big step forward in 
promoting responsible research, she says. 

Although an unpublished manuscript 
describing the experiments lists ten authors, 
according to MIT Technology Review, He, Zhang 
and Qin are the only ones to face penalties so 
far. The manuscript says Zhang “performed 
the human embryo microinjections”, M/T 
Technology Review reports. Zhang, who was 
affiliated with the Guangdong Academy of 
Medical Sciences and Guangdong General 
Hospital in Guangzhou at the time of the 
experiments, has been sentenced to two 
years in prison and fined one million yuan. 
Qin, an embryologist at Southern University 
of Science and Technology who was named 
as the applicant on the experiment listed on 
China’s clinical-trial website, was given an 
18-month suspended prison sentence and 
fined 500,000 yuan, according to Xinhua. Wei 
says it is unlikely that He will work again asa 
researcher at a Chinese institution. 

The trio’s punishments send a powerful 
message to other researchers doing any type 
of gene-editing work in clinical trials in China, 
says Lu You, an oncologist at Sichuan Univer- 
sity in Chengdu. His team was the first to test 
CRISPR gene-editing in humans, ina clinical 
trial that modified adult — not embryonic — 
cells taken from people with lung cancer (see 
page 156). “If | was a newcomer, a researcher 
wishing to start gene-editing research and 
clinical trials, the case would be enough to 
alert meto the cost of such violations,” he says. 
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Wanda 
Diaz Merced 


Astronomy is inextricably associated with 
colourful images of the cosmos. But Wanda 
Diaz Merced says that neglecting senses 
other than sight can mean missing out on 
discoveries. Diaz Merced, an astronomer 
at the International Astronomical Union 
(IAU) Office for Astronomy Outreach in 
Mitaka, Japan, is a pioneer of sonification, 

a technique that converts aspects of data, 
such as frequency, into audible elements. 
It could help astronomers to avoid biases 
from interpreting data only visually, argues 
Diaz Merced, who lost her sight in her 
twenties. At an [AU meeting on equality 
and diversity in Mitaka last December, 

Diaz Merced spoke to Nature about the 
approach. 


How did you begin working on sonification? 
As an intern at NASA in 2005, | created a 
tool to map astronomical data into sound 
— pitch, rhythm and volume. In my 2013 
PhD dissertation, | proved that it is useful. 


How did you do that? 

| presented users with spectral data and 
asked them to look for a double peak that 
indicates a black hole. People tried to 
identify signals using vision only, audio 
only and by combining vision and sound. 
We found that combining audio with visual 
interaction was the most sensitive way. 


How can sonification be used? 

It can help us to study the habitability 

of an exoplanet, by understanding how 
high-energy rays interact with its magnetic 
field or atmosphere. Such interactions cause 
fluctuations of electromagnetic emission 
from the star that vary with frequency. 


Are astronomers using this approach? 
People are making a bit more of an effort. 
My collaborators and | are analysing and 
developing sonification software designed 
around the user's experience. Now we have 
to provide the field with the evidence to 
help researchers to change their mindset. 


Interview by Elizabeth Gibney 
This interview has been edited for length 
and clarity. 
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News in focus 


UEST TOUSE CRISPR 
ENE EDITING TO FIGHT 
DISEASE GAINS GROUND 


As clinical-trial results trickle in, scientists look ahead 
to more-sophisticated medical applications. 


By Heidi Ledford 


he prospect of using the popular 
genome-editing tool CRISPR to treat 
a host of diseases in people is moving 
closer to reality. 

Medical applications of the 
CRISPR-Cas9 system had a banner year in 
2019. The first results trickled in from trials 
testing the tool in people, and newtrials were 
launched. In the coming years, researchers 
are looking forward to more-sophisticated 
applications of CRISPR genome editing that 
could lay the foundation for treating an array 
of diseases, from blood disorders to hereditary 
blindness. 

But although the results of clinical trials 
of CRISPR genome editing so far have been 
promising, researchers say that it is still too 
soon to know whether the technique will be 
safe or effective in the clinic. “There’s beena 
lot of appropriate caution in applying this to 
treating people,” says Edward Stadtmauer, an 
oncologist at the University of Pennsylvaniain 
Philadelphia. “But I think we’re starting to see 
some of the results of that work.” 

It has been only seven years since 
researchers discovered that CRISPR-Cas9, a 
molecular defence system that microorgan- 
isms use to fend off viruses and other invaders, 
could be used to rewrite human genes. Since 
then, gene editing has attracted attention for 
its potential to modify embryos — an applica- 
tion that is ethically and legally fraught ifthose 
embryos are destined to become humans 
(see page 154). But, in parallel, scientists have 
been testing CRISPR’s much less controversial 
ability to disable or correct problematic genes 
in other cells to treat a host of diseases. 

In 2016, Chinese researchers announced 
that they had treated the first person with 
a CRISPR-Cas9 therapy designed to fight 
cancer. Incells extracted fromthe participant’s 
blood, the researchers disabled the gene that 
codes for a protein called PD-1, which holds 
the immune system in check but can shield 
cancer cells inthe process. The scientists then 
reinjected the cells. 

By late 2019, the US government’s 
clinicaltrials.gov database listed more thana 
dozen active studies that are testing CRISPR- 
Cas9 as a treatment for a range of conditions, 
including cancer, HIV and blood disorders. 
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Sickle-cell anaemia is marked by misshapen 
red blood cells. 


“There's been alot of 
appropriate caution 
in applying thisto 
treating people.” 


So far, too few people have been treated 
in these trials for any firm conclusions to be 
drawn about the safety of CRISPR-Cas9 thera- 
pies or how well they work. Preliminary results 
from two trials — one in which gene-edited 
blood cells were transplanted into a man to 
treat HIV infection, and the other in which 
they were transplanted into three people to 
treat cancer — showed no signs of clinical 
improvement. 

In both cases, the transplanted cells 
flourished inthe recipients’ bone marrow, with- 
out any serious safety concerns, but did not 
produce aclear medical benefit. In the HIV trial, 
the researchers attempted to use CRISPR to dis- 
ablea protein that many strains of the virus use 
to enter cells. But only 5% of the transplanted 
cells were edited — not enough to cure disease, 
the researchers said last September (L. Xuetal. 
N.Engl.J. Med. 381, 1240-1247; 2019). The study 
is on hold while researchers explore ways to 
boost that percentage, says Hongkui Deng, a 
stem-cell researcher at Peking University in 
Beijing and alead author of the work. 

There are early hints that another trial might 
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meet with more success. CRISPR Therapeutics 
in Cambridge, Massachusetts, and Vertex Phar- 
maceuticals in Boston, Massachusetts, treated 
two people who have the genetic disorders 
sickle-cell anaemia and B-thalassaemia, in which 
oxygen-carrying haemoglobin moleculesin the 
blood are depleted. The idea is to use CRISPR 
to disable a gene that otherwise shuts down 
production of another form of haemoglobin. 
Early results suggest that the treatment might 
have eased some symptoms of the disorders. 

Other researchers are itching to move 
beyond editing cells in a dish. The challenge 
is in finding ways to transport the gene-editing 
machinery to where it is needed in the body, 
says John Leonard, chief executive of Intellia 
Therapeutics, a biotechnology company in 
Cambridge, Massachusetts, that is focused 
on CRISPR-Cas9 genome editing. 

Last July, the pharmaceutical companies 
Editas Medicine in Cambridge, Massachu- 
setts, and Allergan in Dublin launched atrial to 
treat Leber congenital amaurosis 10, agenetic 
disorder that can cause blindness, by editing 
eye cells. Researchers will inject the eye witha 
virus containing DNA that encodes the CRISPR 
genome-editing machinery, bypassing the 
need to guide those tools through the blood- 
stream tothe specific tissues. The virus will be 
responsible for carrying the genome-editing 
tools into cells. It is the first trial to attempt 
CRISPR-Cas9 gene editing inside the body, 
and early results could be reported this year. 

That would be a landmark moment for the 
field, says Charles Gersbach, a bioengineer at 
Duke University in Durham, North Carolina. 
But he and others say that they hope research- 
ers will eventually move away from using 
viruses to shuttle genome-editing machinery 
into cells. Deactivated viruses can provoke 
immune responses, and carry only a limited 
amount of DNA. 

What’s more, some gene-editing tools 
are too large to fit inside commonly used 
gene-therapy viruses, says Andrew Anzalone, 
a chemical biologist at the Broad Institute of 
MIT and Harvard in Cambridge, Massachu- 
setts. These include the souped-up CRISPR 
systems called prime editors that were first 
reported last year (A. V. Anzalone etal. Nature 
576, 149-157; 2019). 

Intellia is looking for a way around using 
the viruses. The company has partnered 
with Swiss pharmaceutical giant Novartis to 
develop fatty nanoparticles that can protect 
genome-editing molecules as they travel 
through the bloodstream, but can also pass 
through the membranes of target cells. 

None of the technologies currently being 
tested is what researchers foresee for the 
long-term applications of genome editing, 
says Gersbach. “The approaches that people 
are taking are the things that we can do today,” 
he says, “but not what we would doif we could 
design the ideal drug.” 


EYE OF SCIENCE/SPL 


Feature 


In some animals, three blade-like proteins 
together form the Piezo1 channel, which 
helps their cells to sense touch. 


FEEL THE FORCE 


After decades of puzzling over how cells sense touch and pressure, scientists are 


zooming in on the proteins responsible. By Amber Dance 


he girl tried hard to hold her arms 
and hands steady, but her fingers 
wriggled and writhed. If she closed 
her eyes, the squirming got worse. It 
wasn’t that she lacked the strengthto 
keep her limbs still — she just didn’t 

seem to have control over them. 
Carsten Bonnemann remembers 
examining the teenager at ahospitalin Calgary, 
Canada, in 2013. As a paediatric neurologist 
with the US National Institute of Neurological 
Disorders and Stroke in Bethesda, Maryland, 
he often travelled to weigh in on puzzling 
cases. But he had never seen anything like this. 
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If she wasn’t looking at her limbs, the girl 
didn’t seem to have any clue where they were. 
She lacked the sense of her body’s position 
in space, a crucial ability known as proprio- 
ception. “This is something that just doesn’t 
occur,” says Bonnemann. 

His team sequenced the girl’s genes, and 
those of another girl with similar symptoms’, 
and found mutations ina gene called P/EZO2. 
Their timing was fortunate: just a few years 
earlier, researchers looking for the mechanisms 
that cells use to sense touch had found that the 
gene encoded a pressure-sensitive protein’. 

The discovery of Piezo2 and arelated protein, 
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Piezol, was a high point in a decades-long 
search for the mechanisms that control the 
sense of touch. The Piezos are ion channels — 
gates in the cell membrane that allow ions to 
pass through — that are sensitive to tension. 
“We've learned a lot about how cells com- 
municate, and it’s almost always been about 
chemical signalling,” says Ardem Patapoutian, 
amolecular neurobiologist at Scripps Research 
in LaJolla, California, whose group identified 
the Piezos. “What we’re realizing now is that 
mechanical sensation, this physical force, is 
also a signalling mechanism, and very little is 
known about it.” 


DAVID S. GOODSELL/RCSB PDB 


Touch underlies the functioning of almost 
every tissue and cell type, says Patapoutian. 
Organisms interpret forces to understand 
their world, to enjoy a caress and to avoid 
painful stimuli. In the body, cells sense blood 
flowing past, air inflating the lungs and the 
fullness of the stomach or bladder. Hearing 
is based on cells in the inner ear detecting the 
force of sound waves. 

Over the past decade, the study of Piezos 
and other mechanosensitive ion channels has 
exploded. More than 300 papers have been 
published on the Piezos alone over the past 
three years. One of the biggest questions is 
how the proteins, situated in the cell mem- 
brane, sense and respond to force. Using 
cryo-electron microscopy (cryo-EM), scien- 
tists have made progress in unravelling the 
Piezo channels’ bizarre, three-bladed struc- 
ture, but a complete mechanism has been 
elusive. Researchers are also finding roles for 
Piezos beyond touch or proprioception. For 
example, Piezos might help to explain why cer- 
tain people are resistant to malaria, and per- 
haps even why astronauts lose bone density 
while in orbit. Already, researchers are begin- 
ning to think about targeting force-sensing 
proteins with medicines to treat, for example, 
chronic pain. 

“For along time, we knewthat cells did this, 
we had no idea how,’ says Miriam Goodman, a 
sensory physiologist at Stanford University in 
California. “Piezo really changed that.” 


Touch and go 


Touch has long been a slippery sense. Other 
senses, suchas sight or taste, are better under- 
stood, says Patapoutian: photons hitting the 
eye or chemicals infiltrating the nose and 
tongueall activate receptors inthesame family. 
These receptors trigger ion channels to open 
and allow positive ions in. That depolarizes the 
cell, converting the stimulus into an electrical 
signal that the brain can decode. 

Scientists suspected that in touch, proprio- 
ception and hearing, one proteinacts as both 
the force sensor and the ion channel, because 
in hearing, the signalling happens fast — in 
microseconds. But the identity of these uni- 
fied sensor-channel proteins remained mostly 
mysterious — at least in mammals. Researchers 
had found some mechanosensitive channels 
in bacteria, fruit flies and nematode worms. 

So Patapoutian and his colleague Bertrand 
Coste hatched a plan. They would start witha 
type of mouse cell that they knew was capable 
of transforming a tiny poke froma pipette into 
a measurable electrical current. Then, Coste 
would knock out candidate ion-channel genes, 
a different one in each batch of cells, and look 
for a batch that suddenly lost its touch sensi- 
tivity. Coste started off confident, thinking it 
would take a few months, maybe even weeks, 
to finda hit. 

Ittook the better part ofa year. Then, shortly 


PRESSURE SENSORS 


Cells have specialized proteins that help them to 
sense force. Piezo proteins are some of the best 
characterized in mammals, and they form channels 
that ferry ions into and out of cells. Researchers are 
trying to determine how the channels open. 


Closure close-up 

Images of closed channels reveal that they are built 
from three Piezo proteins, each contributing a blade 
and beam arranged around a central pore. The channel 
puckers the membrane in which it sits, forming a divot. 


Piezo protein Cap 
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Opening gambits 

There are no pictures of open Piezo channels, but 
researchers have a few theories about how they might 
unlock. The curved blades could flatten to pop open 
the pore, or a force could move the blades, somehow 
acting on the beams to open the channel. 
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The blades might rotate and twist the pore open. 
Some work suggests that each blade can move 
independently. 
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before the end of 2009, he sawsomething — or, 
rather, nothing. Coste poked with his pipette, 
and the cell did not respond. He must have 
eliminated a force-sensitive channel. 

“It was a very nice day,’ recalls Coste, now at 
the French national research agency (CNRS) 
in Marseille. He and Patapoutian named the 
mouse gene Piezo1, from the Greek word for 
pressure, and soon identified Piezo2. Later, the 
team linked Piezo2 directly to touch sensation 
inthesensory neurons and skin cells of mice’. 


Hidden blades 


Researchers were abuzz with the result, 
Goodman recalls, particularly because the 
Piezo proteins were so large and complex. 
Made of more than 2,500 amino acids and 
weighing a hefty 300 kilodaltons, Piezol’s 
structure crosses the cell membrane a 
record-breaking 38 times. (For comparison, 
mammalian proteins typically contain closer 
to 500 amino acids.) 

Unfortunately, that gargantuan size 
hindered researchers trying to answer the 
hottest questions inthe Piezo field: how dothe 
channels sense force? And how do they open 
and close? A protein structure is a big help 
with these kinds of question, says biophysi- 
cist Roderick MacKinnon at the Rockefeller 
University in New York City. “We don’t know 
until we see it.” 

Structural techniques such as X-ray crys- 
tallography and nuclear magnetic resonance 
spectroscopy struggle to cope with big, com- 
plex proteins, says neuroscientist Bailong Xiao 
at Tsinghua University in Beijing, a former 
postdoctoral fellow in Patapoutian’s lab. 

Fortunately, as Xiao was setting up his lab in 
2013, another option for obtaining high-reso- 
lution structures was coming online: cryo-EM. 
His group used the method to report’ the first 
structure of Piezol in 2015, and since then, sev- 
eral higher-resolution versions have followed 
from Xiao’s group, MacKinnon’s, and Patapou- 
tian’s. Last September, Xiao followed up with 
a picture of Piezo2, which is similar to Piezol 
in size and shape. Xiao’s picture of Piezo2 was 
the clearest view yet of the ends of the three 
blades, which move around and so are hard 
to capture’. 

The images were striking. Three Piezo pro- 
teins come together inatrimer that straddles 
the plasma membrane (see ‘Pressure sensors’). 
From the central pore, three arms spiral out 
like the blades ona propeller. They curve up 
and out, creating a deep divot in the surface 
of the cell. 

Patapoutian and Xiao think that when a 
force hits the membrane, the blades move 
protein ‘beams’ on the interior of the channel, 
which somehow drag the pore open. To Mack- 
innon, the unusual way in which the Piezos’ 
blades pucker the membrane suggests a differ- 
ent mechanism: ifa push or pullincreases the 
tension of the membrane, the curved channel 


Nature | Vol577 | 9 January 2020 | 159 


Feature 


might flatten out, opening the pore. 

The hypotheses can’t be tested yet, because 
researchers have been able to study only 
isolated Piezo proteins, separate from their 
membrane and in the closed conformation. 
Snapping a picture of a Piezo ina membrane 
and openshould help scientists to understand 
its secrets. “We want tosee it in its natural envi- 
ronment,” says Patapoutian. 

Several labs are trying to image an open 
Piezo. Patapoutian’s group is using a 
Piezol-activating compound that it named 
Yodal after the diminutive, green, force-wield- 
ing Jedi master in Star Wars. Patapoutian hopes 
that with Yodal present, Piezol might open up 
fora picture. He is also interested in plugging 
Piezo proteins into artificial membranes called 
nanodiscs, which might help to stabilize the 
open conformation. Xiao, meanwhile, is work- 
ing with cryo-electron tomography, which 
involves imaging the sample at different tilt 
angles, and might help to clarify the structure 
ina native or artificial membrane. 


Sore spot 


In parallel with the structural studies, scientists 
are finding that Piezo proteins have diverse 
roles in the body. 

In 2014, neuroscientist Alex Chesler had just 
joined the National Center for Complementary 
and Integrative Health in Bethesda. Inspired by 
Coste’s discovery, he was creating mice lack- 
ing Piezo2 to investigate the channel’s role in 
touch. Then, one day, he received an e-mail 
from BOnnemann, who worked in his building, 
about the girls who lacked proprioception. 

Chesler ran straight upstairs to Bonne- 
mann’s office. Panting, he announced, “You 
have no idea what you have.” Chesler couldn't 
directly ask mice lacking Piezo2 what they felt, 
or rather didn’t feel — but he could ask people. 

He and B6nnemann invited the girls to 
Bethesda to evaluate their condition more 
extensively. Both girls could compensate for 
their disability remarkably well, using vision 
to help them walk in a line or touch a target. 
But blindfolded, they struggled. Similarly, 
they could sense the vibration of a tuning fork 
against their skin because they could hear it. 
While wearing noise-cancelling headphones, 
they didn’t notice the vibration at all’. 

Patapoutian found the same phenomenon 
in mice: without Piezo2 in nerves that supply 
muscles and tendons, they lacked proprio- 
ception and were uncoordinated®. His team 
also found a role for Piezo2 in pain-sensing 
neurons in allodynia, a specific type of pain 
sensation in which even a gentle caress feels 
like being pricked with needles. Some people 
with neuropathic pain experience this hyper- 
sensitivity all the time. 

Mice normally show allodynia when injected 
with capsaicin — the spicy molecule found in 
hot chilli peppers — or after nerve injury, but 
not if they were missing the Piezo2 gene’. 
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Chesler and BOnnemann reported similar 
changes to pain perception among people 
with PIEZO2 mutations’. 

“Chronic pain has sucha debilitating effect,” 
says Swetha Murthy, who led one of the allo- 
dynia studies while a postdoc with Patapou- 
tian. “I think we can start looking at drug 
targets for Piezo2 for these neuropathies.” 
Both Patapoutian and Chesler are on the hunt 
for compounds that would block Piezo2 activ- 
ity at a pain site, without interfering with the 
protein’s other roles throughout the body. 
“There is huge potential for Piezo-channel 
drug discovery,” says Xiao. 

It’s not only neurons that need to sense 
touch; almost every cell is subject to some kind 
of force. Take red blood cells, which deform to 
squeeze through tiny capillaries. Mutations 
that hyperactivate Piezol cause blood cells to 
shrivel, and that can provoke anaemia in peo- 
ple witha rare condition known as dehydrated 
hereditary stomatocytosis. 

Those scrunched-up blood cells reminded 
Patapoutian of sickle-cell anaemia. The 


“Thereis huge potential 
for Piezo-channel drug 
discovery.’ 


sickle-cell gene mutation has persisted in 
many people of African descent because it 
protects against malaria, and Patapoutian 
wondered whether PIEZO1 mutations might 
do the same. 

Ifso, there should bea relatively high rate of 
such mutations in people of African descent. 
Database searches revealed that Patapoutian 
was right: in fact, one particular PIEZO1 variant 
appeared in one-third of people in the data- 
base with African ancestry’. A separate team 
has reported that carriers of this PIEZOI muta- 
tion are resistant to severe malaria”. 

Piezol also has a job in bone formation 
and maintenance, according to work from 
Xiao’s lab. When his team knocked out Piezo 
in mouse osteoblasts — bone-making cells — 
the animals grew up shorter and skinnier than 
normal. The long bones that support body 
weight were lighter, thinner and weaker than 
in control mice. 

Moreover, wild-type mice that are partially 
suspended in the air — so they don’t have to 
support their full body weight — have lower 
levels of Piezol expression and bone mass". 
It’sa phenomenon much like what happens to 
people with osteoporosis, those who are bed- 
ridden and astronauts and cosmonauts aboard 
the International Space Station, says Xiao. 


Pressure points 


“The discovery of Piezos was a huge step 
forward for the whole field,” says Kate Poole, a 
biologist at the University of New South Wales 
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in Sydney, Australia, but “it is also clear that the 
story is not just Piezos.” 

Scientists interested in hearing have been 
chasing the relevant channel for four decades. 
“There have been a lot of false leads along the 
way,” says Jeffrey Holt, a neuroscientist at 
Boston Children’s Hospital in Massachusetts. 
“Now we think we've got a pretty solid handle 
onit.” 

The key channel protein is called TMC1. 
When Holt altered amino acids in TMCI1, the 
procedure changed the ability of inner-ear 
cells to translate mechanical signals into 
electrical ones”. Another report showed that 
purified TMC1 is able to create a mechano- 
sensitive ion channel in artificial membra- 
nous bubbles”. The structure of TMC1is still 
a mystery, however, because the protein has 
been hard to purify in sufficient quantities to 
get good cryo-EM images. 

Patapoutian’s team, meanwhile, is looking 
for entirely new channel families. In 2018, 
he, Murthy and Scripps structural biologist 
Andrew Ward reported what they think could 
be the largest group of mechanically activated 
channels. They knew of a protein family that 
helps plants to sense osmotic pressure — the 
OSCA proteins — and reasoned that they might 
sense force more generally. In human kidney 
cells, OSCAs did indeed respond to Murthy’s 
stretching of the cell membrane”. 

The researchers also knew from previous 
studies that the OSCA proteins were closely 
related to another family of proteins in 
mammals, the TMEM63 proteins. TMEM63 
channels from mice, humans and even fruit 
flies responded to stretch in Murthy’s assays, 
too, so OSCA and TMEM63 proteins make up 
alarge family of force sensors that iscommon 
to many living things. 

The channels discovered so far cannot 
explain all instances of cellular mechano- 
sensitivity, says Murthy, now a biophysicist 
and neuroscientist at Oregon Health & Science 
University in Portland. More mechanosensors 
must be out there. 

And those sensors probably have more 
jobs than are known today, says Patapoutian. 
“We've just barely scraped the surface.” 


Amber Dance is a freelance writer in Los 
Angeles, California. 
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Cataglyphis ants use astonishing navigational techniques to find their way back to the nest after foraging. 


Wayfinding wonder: how 
an ant navigates the desert 


Asumptuous book reveals the location-sensing secrets 
of adriven insect. By William A. Foster 


he fear of getting lost and being unable 
to find our way homeis woven into the 
stories we hear as children: it can haunt 
us for years. Humanity’s navigational 
skills are poor and increasingly rarely 
used, leaving us to view feats of animal navi- 
gation witha mixture of envy and admiration. 
How do Atlantic salmon find their way back 
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to the streams where they were born, after 
up to three years at sea? How do Arctic terns 
find their breeding sites in the far north after 
excursions of more than 70,000 kilometres 
to the Antarctic? 

Desert Navigator is the story of how atiny 
ant (Cataglyphis spp.) became the ideal model 
organism for the study of animal navigation. It 


© 2020 Springer Nature Limited. All rights reserved. 


begins 50 years ago in a vast Saharan salt pan, 
where alone, shiny black ant caught the eye of 
neuroethologist Riidiger Wehner as it scuttled 
across the sand. Eventually, it discovered the 
corpse of a large fly, gripped it firmly in its 
mandibles, and then performed the manoeu- 
vre that launched Wehner’s field of research. 

The ant set off in a straight line, crossing 
more than 100 metres of the barren ground 
to disappear into an inconspicuous hole — the 
entrance to its underground colony. The only 
plausible explanation is that the ant knew all 
along exactly where it was in relation to its 
home nest. But how does Cataglyphis manage 
this, with a minute brain and no mobile phone? 

Wehner unspools the answer over the book’s 
seven chapters, describing the astonishing 
subtlety, intricacy and diversity of the tech- 
niques used both by the ants in finding their 
way home and by researchers in discovering 
how they doit. The ants plot their compass 


AGEFOTOSTOCK/ALAMY 


MICHEL ROGGO/NPL 


direction using patterns of polarized light and 
gradients of colour and light intensity in the 
sky, along with the position of the Sun, backed 
up by cues from Earth’s magnetic field and the 
wind direction. They know where they are by 
counting the steps they have taken and keep- 
ing track of the direction they were following 
at the time. They can memorize panoramic 
‘snapshots’ of landmarks, such as boulders, 
around their goals. Somehow, their brains 
integrate all this information so that their 
foraging journeys can be optimally organized. 

The central message is that the ant’s sophis- 
ticated repertoire of navigation behaviour 
actually emerges from a large number of rel- 
atively simple elements, which Wehner calls 
the ant’s toolkit. These are combined in dif- 
ferent ways, depending on the problem. This 
bottom-up, modular approach provides the 
key to understanding the ants’ navigational 
prowess and how it might have been honed 
during their development and evolutionary 
history. Wehner sees no need to introduce the 
further complication of a mental map, often 
invoked, for example, in studies on rodents: 
the ant can find its way around simply by 
cunning use of its toolkit’s components. 

The toolkit concept underpins the investi- 
gative protocols used by Wehner and his team. 
The basic approachis to change a stimulus with 
surgical precision. The researchers carefully 
measure the resulting mistake the ant might 
make when performing a particular task, com- 
pared with an unmanipulated control ant. The 
exact behavioural consequence of the change 
in that particular stimulus is now revealed, 
although the physiology of how this effect is 
producedis usually not investigated, basically 
because the ant’s brain is so minuscule. 

For example, to prove that the ants could 
measure distance travelled by counting the 
number and length of strides taken, the team 
added stilts to the insects’ legs. An ant that 
arrived at the food source and then was given 
aset of stilts took the correct number of strides 
back to the nest. But it overshotits destination 
tothe extent predicted by the increase in stride 
length. Even more astonishingly, the ants can 
allow for the extent to which they move up 
and down over bumps during a journey: they 
turn their 3D experience into an accurate 2D 


Desert Navigator: 

f The Journey of an Ant 
Rudiger Wehner 
Harvard University 
Press (2020) 


Desert Navigator 


THE JOURNEY OF AN ANT: 


”— Riidiger Wehner 


Atlantic salmon (Salmo salar) can return to spawning sites after years away. 


measurement. The researchers established 
this by forcing ants to take an outward forag- 
ing trip ona long ‘roller coaster’: a sawtooth 
track across the desert. Onthe returnjourney 


“The ant’s sophisticated 
navigation behaviour 
emerges froma large 
number of simple elements.” 


over flat ground, the ants predicted accurately 
where the nest should be. How they do this is 
not known. 

A hugely important aspect of this research 
is that the intricate experiments on the desert 
navigators are carried out under natural field 
conditions. This is unusual. Experimental 
studies on navigation in rats, for example, 
are routinely carried out in small-scale labo- 
ratory enclosures. Chapter 2, the longest in 
the book, is devoted to an absorbing account 
of the biology and ecology of the desert ants. 
This embodies a key lesson that Wehner would 
like to give to all students of animal navigation: 
“start out as a naturalist”, he says, and “let the 
animal guide your way of investigation”. 

My only reservation is that this is a 
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demanding book for its intended audi- 
ence — the general reader. Wehner explains 
everything clearly and concisely, but only 
once. There are more than 100 detailed figures 
of the results of individual experiments, some 
of which might involve up to 10 variables, such 
as which part ofan ant’s eye was covered; each 
figure requires several minutes of detailed 
concentration. A glossary, chapter summaries 
and alittle more time spent on explanation 
would have helped. Wehner’s prose is pre- 
cise and sometimes quite idiosyncratic, but 
his respectful admiration for these “elegant, 
skilled, and vivacious little runners” always 
shines through. 

This sumptuously produced book is a 
triumph both of natural history and of science, 
with lessons that reach well beyond the study 
of animal navigation. Understanding how this 
wonderful eusocial insect can accomplish its 
apparently miraculous feats of navigation 
has required imagination, intelligence and 
decades of disciplined application. Here, in 
one place, we can at last savour the full glory 
of this remarkable achievement. 


William A. Foster is emeritus curator of insects 


in the Department of Zoology at the University 
of Cambridge, UK. 
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Twisted timelines ofa 
double dystopia 


Anovel from science-fiction giant William Gibson makes 
us consider what future we wish for. By Liesbeth Venema 


ith a breezy “Here we go,” Eunice 

enters the room — and the 

third page of William Gibson’s 

speculative novel Agency. | 

instantly liked her. Resourceful, 
fast-talking and street-smart, Euniceis ona 
serious mission: to give the world a fair chance 
to fend off the end of humanity. But she doesn’t 
waste time on existential angst. And she has 
kindness to spare (along with a handy info- 
graphic explaining the narrative structure of 
the 2010 film Inception). 

Ifshe sounds too perfect, she is. Eunice is an 
autonomous personal artificial-intelligence 
assistant, developed with technology 
mysteriously ahead of her time and thrown 
into the lap of the novel’s real protagonist, 
beta tester Verity Jane. Agencyis a feverish tour 
through worlds where the need to harness the 
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latest technological advances to avert or miti- 
gate global catastrophes overwhelms all else. 

To understand and fulfil her mission, Eunice 
needs Verity. She must also evade meddling 
from the company that dispatched her. Eunice 
puts all sorts of high-tech measures in place 
to prevent the company from listening in on 
her conversations with Verity and finding out 
about her elaborate plans to free herself from 
its clutches. She arranges deep-faked artificial 
speech generation to mask discussions inthe 
pair’s apartment. She hires freelancers to make 
state-of-the-art drones that track stalkers. 
When the time comes, she even disappears, 
distributing herself into many parts to ready 
herself for a reboot. 

Needless to say, all this leaves Verity 
perplexed — the only emotion she ever 
displays. It is hard to feel invested in a 
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character so devoid of personality. But the 
alternative for Verity, we soon realize, might 
be constant terror. Her world is beset by daily 
climate catastrophes and is on the brink of 
nuclear war. She lives, we find, ona timeline 
that branched off from our ‘real’ onein 2015. 
Mindbendingly, Gibson sets our own time- 
line more than two centuries into the future. 
Agency shuttles between the two. 

With this, we realize that we’re back in 
the world of The Peripheral (2014), Gibson’s 
previous novel. In that story, he developed 
a clever take on time travel that avoids the 
usual paradoxes. Somewhere in our future, 
cybertechnology has advanced to a point 
where it is possible to send digital messages 
tothe past. Once a message is received, anew 
timeline branches off. Gibson dubs these 
‘stubs’. With a digital link established, it is 
even possible to operate, from one timeline, 
a physical telepresence — a machine, robot or 
cyborg avatar called a peripheral — in another. 


AGENCY 


Agency 
William Gibson 
Berkley (2020) 


GETTY 


Who creates the stubs? It’s never fully clear. 
But we find that Verity’s was maliciously 
crafted to cause misery to humanity. Yet, in 
it, Donald Trump never became US presi- 
dent, and the United Kingdom’s referendum 
on whether to stay inthe European Union was 
won by Remain. The irony is notable. 

But our timeline, we find, has fared no 
better. Glimpses of a London inthe twenty-sec- 
ond century reveal humanity barely surviv- 
ing after severe political instability, climate 
change, rising poverty and organized crime. 
The only reason our species has hung onis that 
crucial technological advances arrived justin 
time. Througha peripheral, Verity — in her sep- 
arate timeline — encounters characters in this 
future London attempting to steer our world 
towards aless disastrous outcome. 

After this unusual meeting, the action 
becomes less interesting. Gibson’s portrayal of 
areality free from Trump and Brexit, yet beset 
by catastrophe, amounts to little. We occasion- 
ally hear about the television appearances of 
a female president, presumably modelled on 
Hillary Clinton, which show her as calm and 
capable in the face of every disaster. When 
Agency was trailed, much was made of this 
scenario. But despite the big questions it 
raises, the concept remains underdeveloped 
inthe story. 

Overall, however, the narrative is complex 
and intriguing, even if the cast of dozens — 
brought together for niche specialisms from 
designing drones to organizing events — isn’t 
always. Along with Eunice, an exception is 
Ainsley Lowbeer. I desperately wanted to 
hear more about this semi-mythical special 
detective from future London. She owns a 
self-cloaking car and can make space empty 
itself around her. And she is the main force 
behind attempts to orchestrate small changes 
to other timelines to help stabilize them: for 
instance, by nudging Eunice to greater agency. 

The dialogue is snappy, the technospeak 
relentless. Although character development 
is scant, the interaction between humans and 
advanced technology (such as Verity com- 
muning with Eunice through a set of smart 
glasses) is described in fascinating detail. In 
fact, Agency offers a refreshing variation on 
the usual limited artificial-intelligence trope. 

Eunice has a skill set based on that of an 
African American naval official, Marlene 
Miller. She declares herself an independent 
global citizen, and never bothers to ponder 
the question of whether she is sentient or not. 

All Eunice knows is that she is herself. Most 
of all, she has agency, the capability to act, 
and she doesn’t waste a moment in doing so 
throughout. And that is important. Gibson’s 
message seems to be: prepare for the worst, 
and get technology on your side. 


Liesbeth Venema is chief editor of Nature 
Machine Intelligence. 
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THE SCIENCE OF Who WE TRUST 


anv What we weLieve 


HUGO MERCIER 


Not Born Yesterday 

Hugo Mercier Princeton Univ. Press (2019) 

We're a hard species to dupe, argues cognitive scientist Hugo Mercier in 
this persuasive study on the science of belief. Drawing on research and 
historical case studies from the French Revolution to the Nazis, Mercier 
demonstrates that we have powerful cognitive mechanisms that 
enable us to weigh up a range of cues. Thus governmental propaganda 
succeeds by riding on existing opinion, not changing it. And vigilance, 
not credulity, seems to make false rumours (so often about threats) 
stick. A bracing book that might make you less gullible about gullibility. 


THE STORY 
OF CHINA'S 


INSTANT } 
cITY 


The Shenzhen Experiment 

Juan Du Harvard Univ. Press (2020) 

Just 40 years after then-Chinese premier Deng Xiaoping proposed 

to make Shenzhen a special economic zone, it has become a 
conurbation of 20 million, a technology hub and a symbol of China’s 
economic transformation. But as urban planner Juan Du shows in this 
deep dive of a history, the ‘instant city’ narrative is a myth. Sweeping 
aside slick origin stories, Du reveals a reality in which Shenzhen’s 
prosperity is driven by oyster fishers, vibrant night markets and the 
aspirations of millions, not just by the policymakers of Beijing. 


Daniel J. Levitin 


couthor of This Is Your Brain on Music 


Successful 


A Neuroscientist Explores the 
Power and Potential of Our Lives 


Successful Aging 

Daniel J. Levitin Dutton (2020) 

Neuroscientist Daniel Levitin reveals the ‘dreaded’ age of 60 to bea 
developmental stage with specific advantages, and a launchpad for 
productive elderhood. Looking in turn at the latest neuroscience, 
behavioural studies and findings on longevity and cognitive 
enhancement, Levitin delves into the multiple-trace theory of 
memory, the ageing microbiome, fats and the brain, the impacts 

of neural implants, and the joys of non-retirement. A clear-eyed, 
insightful overview of the neurophysiological healthspan. 


Cook, Taste;-Learn 
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Cook, Taste, Learn 

Guy Crosby Columbia Univ. Press (2019) 

This chronology of culinary science was part-inspired, nutritionist Guy 
Crosby tells us, by primatologist Richard Wrangham’s theories on how 
fire and cooking influenced human evolution. Those are contested. 
Nevertheless, Crosby’s historical tour of kitchen physics and chemistry 
is a sprightly delight. Here are invitations to peer into the architecture 
of “optimally kneaded dough”, goggle at the oldest known recipe (on 
a 3,750-year-old clay tablet from Babylonia), wonder at a seventeenth- 
century pressure cooker and contemplate the terroir of Vidalia onions. 


The 
True Creator 


The True Creator of Everything 

Miguel Nicolelis Yale Univ. Press (2020) 

The human brain, contends neuroscientist Miguel Nicolelis, is a 
peerless “organic computing device” that has authored a singular 
cosmos: human culture, from cave art to robotics. Within a framework 
he dubs relativistic brain theory, Nicolelis launches a mind-bending 
journey through neurological evolution, electromagnetism in relation 
to the brain, brain-machine interfaces and beyond. He warns, 
however, that our compulsion to monetize, and our addiction to 
“digital logic”, jeopardize our future. Barbara Kiser 
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ILLUSTRATION BY DAVID PARKINS 


Setting the agenda in research 


Comment 


Check for publication 
integrity before misconduct 


Andrew Grey, Mark J. Bolland, Alison Avenell, Andrew A. Klein & C. K. Gunsalus 


A tool that focuses on papers 
— not researcher behaviour 
—canhelp readers, editors 
and institutions assess which 
publications to trust. 


fit is published in the scientific literature, 
can you trust it? All too often, that ques- 
tion gets lost, sidetracked or buried. Even 
when serious, credible concerns are sent 
to ajournal, decisions over whether to cor- 
rect or retract are more likely to take years than 
months — time during which potentially harm- 
ful misinformation can spread. Delays and 
inaction often happen because enquiries tend 
to focus on the thorny question of whether a 
researcher acted deliberately to deceive. The 
moreimportant issue, however, is the integrity 
of the actual publication: isthe research article 
reliable and are its conclusions valid? 

As researchers who have spent years 
enmeshed in investigations, we think a major 
obstacle to evaluating the integrity of publi- 
cations is a lack of tools. The Committee on 
Publication Ethics (COPE) advises publishers 
to retract articles when there is “clear evidence 
that the findings are unreliable”, but does not 
advise on howto determine whether that is the 
case. Resources for editors also focus on how 
to manage communications, rather than on 
howto evaluate reliability and validity. The net 
effect is inaction: readers remain uninformed 
about potential problems with a paper, and 
that can lead to wasted time and resources, 
and sometimes put patients at risk. 

The integrity of a publication can be com- 
promised in many ways. Some are uninten- 
tional: typos, transcription errors or incorrect 
analyses. Others are deliberate: image manip- 
ulation, data falsification and plagiarism’. 
How publication integrity was compromised 
is secondary to whether the paper is reliable. 
Unreliable data or conclusions are problems 
irrespective of the cause. 

Enter checklists, which have helped in the 


‘> 


R, T 


structuring of complex procedures in health 
care and other industries. One of us (C.K.G.) 
helped to develop a checklist for assessing 
the quality of institutional investigations of 
researchers’ conduct?. Academic publishing 
has also introduced a suite of checklists to 
be completed by authors when submitting 
papers, to make sure certain aspects of the 
research are fully reported. 

Here we present a tool — the REAPPRAISED 
checklist — that aims to help readers, journal 
editors and anyone else assess whether a paper 
has flaws that call its integrity into question. 
We developed it on the basis of our own experi- 
ence and extensive consultation with research 
administrators and journal editors. Although 
designed for clinical and animal studies, the 
structured approach to investigation applies 
more broadly. Readers, peer reviewers, jour- 
nals, publishers and institutions can use it to 
assess whether to trust a paper’s findings. Our 
checklist should not be confused with jour- 
nals’ submission checklists, which are filled 
in by authors before publication and indicate 
what items are reported inthe manuscript. The 
REAPPRAISED checklist can be used by anyone 
struggling to assess a submitted or published 
article, and includes common-sense assess- 
ments that go beyond the text itself. Itcan, and 
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should, be applied independently of whether 
misconduct is suspected. Its use can help to 
speed up the identification and correction of 
flawed papers, preventing wasted resources 
and even protecting patients from harm. 


Cause for alarm 


How did wecometosee the need for this tool? 
From early 2013, three of us (A.A.,A.G., M.J.B.) 
began to contact journals about multiple, seri- 
ous problems we had identified in 33 reports of 
trials led by bone-health researchers Yoshihiro 
Sato andJun Iwamoto’. The first retraction did 
not appear until late 2015. 

This delay is all the more regrettable given 
that concerns had been raised more than a 
decade earlier. In 2003, a publication by Sato* 
assessed avery rare complication of treatment 
of Parkinson’s disease. Withina year, aletter° to 
the editor remarked on how surprising it was 
that the research group had managed to iden- 
tify 40 people with this complication ina very 
short time, because the writer’s specialist insti- 
tute had seen onlytwo cases “inliving memory”. 
Others had brought up concerns about ethical 
oversight®, as well as failure of randomization, 
implausible recruitment and outcomes, and 
other problems’, but no editorial comment or 
correction occurred at that time; retraction 
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THE ‘REAPPRAISED' CHECKLIST FOR 


EVALUATION OF PUBLICATION INTEGRITY 


Not all items will be applicable to every publication, and other 
questions might be relevant for individual categories. 


R— Research governance 

C1 Are the locations where the research took place specified, 
and is this information plausible? 

C1 Is a funding source reported? 

C1) Has the study been registered? 

0 Are details such as dates and study methods in the publication 
consistent with those in the registration documents? 


E— Ethics 

(J Is there evidence that the work has been approved by 
a specific, recognized committee? 

OJ Are there any concerns about unethical practice? 


C1 ‘P-hacking’: biased or selective analyses that promote 

fragile results 

CJ Other unacknowledged multiple statistical testing 

C1] Is there outcome switching — that is, do the analysis and discussion 
focus on measures other than those specified in registered analysis 
plans? 


1 — Image manipulation 
(J Is there evidence of manipulation or duplication of images? 


S — Statistics and data 
C1 Are any data impossible? 
CJ Are subgroup means incompatible with those for 


A— Authorship 

C1 Do all authors meet criteria for authorship? 

OJ Are contributorship statements present? 

OJ Are contributorship statements complete? 

C1] Is authorship of related papers consistent? 

OJ Can co-authors attest to the reliability of the paper? 


the whole cohort? 

CJ Are the reported summary data compatible with the 
reported range? 

CJ Are the summary outcome data identical across study 
groups? 

CJ Are there any discrepancies between data reported in 
figures, tables and text? 

C1 Are statistical test results compatible with reported data? 


P — Productivity 

C1 Is the volume of work reported by research group plausible, 
including that indicated by concurrent studies from the same group? 

C1 Is the reported staffing adequate for the study conduct as reported? 


OJ Are any data implausible? 

ZC Are any of the baseline data excessively similar or different 
between randomized groups? 

CJ Are any of the outcome data unexpected outliers? 

ZC Are the frequencies of the outcomes unusual? 


P — Plagiarism 

OJ Is there evidence of copied work? 

C1 Is there evidence of text recycling (cutting and pasting text between 
papers), including text that is inconsistent with the study? 


ZC Are any data outside the expected range for sex, 

age or disease? 

CJ Are there any discrepancies between the values for 
percentage and absolute change? 

CJ Are there any discrepancies between reported data and 
participant inclusion criteria? 

(Are the variances in biological variables surprisingly 


R — Research conduct 

C1] Is the recruitment of participants plausible within the stated time 
frame for the research? 

OJ Is the recruitment of participants plausible considering the 
epidemiology of the disease in the area of the study location? 

OJ Do the numbers of animals purchased and housed align with 
numbers in the publication? 

C1] Is the number of participant withdrawals compatible with the 
disease, age and timeline? 

C1 Is the number of participant deaths compatible with 
the disease, age and timeline? 

C1 Is the interval between study completion and 
manuscript submission plausible? 

OJ Could the study plausibly be completed as described? 


consistent over time? 


E—Errors 

C1) Are correct units reported? 

C1 Are numbers of participants correct and consistent throughout the 
publication? 

C1 Are calculations of proportions and percentages correct? 

7 Are results internally consistent? 

C1 Are the results of statistical testing internally consistent and 
plausible? 

OJ Are other data errors present? 

OJ Are there typographical errors? 


A— Analyses and methods 
C1 Are the study methods plausible, at the location specified? 
OJ Have the correct analyses been undertaken and reported? 
C1] Is there evidence of poor methodology, including: 

XC Missing data 

CZ] Inappropriate data handling 
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D — Data duplication and reporting 

CJ Have the data been published elsewhere? 

OJ Is any duplicate reporting acknowledged or explained? 

C1 How many data are duplicate reported? 

C1 Are duplicate-reported data consistent between publications? 
C1 Are relevant methods consistent between publications? 

C1 Is there evidence of duplication of figures? 
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notices were finally issued in 2015-18. 

Even after Sato admitted* in 2016 to making 
up data, only 2 of the 34 journals we contacted 
took the initiative to assess other papers they 
had published by this group. 

Investigations conducted at the four institu- 
tions at which Sato and Iwamoto worked were 
also, in our opinion, misdirected and incom- 
plete. They focused on identifying researcher 
misconduct, not publication reliability. They 
assessed only 90 of 351 potentially compro- 
mised publications, and only after we contacted 
them in 2017 with concerns’. Two institutions 
failed to reach aconclusion about the reliability 
of 56 publications of the 78 they considered, 
because they could not determine whether 
misconduct had occurred. The other two did 
not report assessments on individual papers. 
(The institutions maintain that their investiga- 
tions were appropriate, but have not responded 
directly to the criticisms, and two stated that 
it was difficult to assess publication integrity 
independently of misconduct.) 

So far, at least 90 papers by Sato or Iwamoto 
have been retracted. More than twice that 
number remain in the literature, including 5 
of the 33 clinical-trial reports. Readers have 
no way of knowing whether those reports are 
trustworthy. Nor do they have any guidance 
for making their own assessments. 

These papers continue to be influential. In 
the years since we brought our concerns to 
journals, those 33 clinical-trial reports have 
been cited more than 600 times. They have 
been used by other researchers as evidence to 
help justify at least eight clinical trials’°. Others 
that were later retracted were used in an effec- 
tiveness review conducted by the US Agency 
for Healthcare Research and Quality. They pro- 
vided the sole evidence that bisphosphonates, 
commonly used for osteoporosis, could pre- 
vent fractures in patients at high risk of falls". 

Other cases had similar outcomes. More 
than ten years elapsed between the initial 
notification of concerns about publications 
of clinical studies in anaesthesia by Yosh- 
itaka Fujii and the first of 183 retractions 
(see go.nature.com/2pvw2ax). Concerns 
raised about problems in a paper by Andrew 
Wakefield on autism and vaccination did not 
result in full retraction until 12 years after 
its publication. Flawed papers such as these 
influence future research initiatives, are incor- 
porated into clinical guidelines and influence 
medical practice and public perceptions, so 
could potentially harm millions of patients. 


Improving assessment 


Our REAPPRAISED checklist facilitates sys- 
tematic evaluation through 11 categories 
(see ‘The ‘REAPPRAISED list for evaluation 
of publication integrity’). It covers ethical 
oversight and funding, research productivity 
and investigator workload, validity of random- 
ization, plausibility of results and duplicate 


data reporting. It can identify problems from 
isolated data errors to data fabrication or fal- 
sification. Some of these questions should 
ideally have been asked by reviewers and edi- 
tors before publication, and not all questions 
will apply to every paper, butitis still useful to 
consider all the questions collectively when 
assessing an article. 

We developed the checklist in our roles as 
researchers (A.G., M.J.B., A.A.) and journal 
editor (A.A.K.) while evaluating thousands 
of publications; C.K.G., a specialist in institu- 
tional investigations, helped to refine it. 

Our experience suggests that using 
REAPPRAISED can assist in decision-making 
before and after publication. A.A.K. imple- 
mented an earlier version of the checklist as 
editor-in-chief of Anaesthesia, which reviews 
nearly 1,000 submissions per year. Each is 
screened with the checklist; if concerns arise, 
individual patient data are requested and 
reviewed carefully for errors, inconsisten- 
cies or other red flags. In the past two years of 
routine use, editors have identified integrity 
problems in 42 submissions. A large subset 
of these — comprising work from 12 research 
groups — was serious enough to notify authors’ 
institutions. Six of the 12 ensuing investigations 
confirmed that data had been falsified or fab- 
ricated; two laid the blame on errors; and four 
are ongoing (A.A.K., personal communication). 

We have also found this checklist effec- 
tive in communicating with journals, having 
used it to submit a structured list of concerns 
to journal editors for the 56 publications 
for which institutional investigations were 
inconclusive. That has led to 29 retractions 
and 5 expressions of concern. 


Implementation 


We would like to see the checklist used during 
both manuscript review and post-publication 
evaluation. The fact that it separates assess- 
ment of publication integrity from the inves- 
tigation of research misconduct will speed up 
evaluations. It could even be published along- 
side decisions to retract, issue an expression of 
concern, correct a paper, or let it stand. 

We expect that use of REAPPRAISED will 
lead to more detailed, efficient, consistent 
and transparent evaluations of publication 
integrity, and thus faster and more accurate 
reporting of corrections and retractions. These 
improvements will benefit the researchers, 
clinicians, policymakers, patients and others 
who rely on the literature to make decisions. 
People using the tool will be able to help refine 
it as they gain experience, andit will help them 
to develop standards to assess the integrity of 
publications and act accordingly. We hope 
others will join in our efforts to implement 
and refine REAPPRAISED both informally and 
in future publications. 

Integrity problems often cluster. Authors 
who havea paper retracted for misconduct are 
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more likely to accrue multiple retractions than 
are those whose work was retracted for other 
reasons", Publications by their co-authors can 
also be compromised”. The checklist could 
help journal staff and investigation commit- 
tees to decide when an assessment should 
be broadened to include other papers by a 
particular researcher and collaborators. 

Ironically, a checklist that puts aside the 
question of misconduct might aid in evalu- 
ations of inappropriate behaviour. If multi- 
ple concerns are identified, or the concerns 
identified are those often associated with mis- 
conduct, the entire body of an author’s work 
should be systematically assessed. 

Publishers’ integrity groups should adopt 
the checklist (or ask those expressing concerns 
to doso). Funders and government regulators 
should disseminate it to publishers, research 
institutions and other stakeholders. Peer 
reviewers and readers can use it on their own 
initiative, and those who have anagging feeling 
about a publication can use it to work through 
their concerns and, if merited, communicate 
those to ajournal. 

If the goal is trustworthy literature, the 
integrity of publications — not just determi- 
nation of misconduct — should be the focus 
of investigation. 
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Readers respond 


Correspondence 


Brazil’s biofuel plans 
drive deforestation 


The scientific community, 
Brazil’s policymakers and the 
public need to take coordinated 
action against plans to 

amplify biofuel production 

at the expense of the Amazon 
rainforest. Brazilian President 
Jair Bolsonaro lifted the ban on 
sugar-cane cultivation inthe 
Amazon by decree in November 
2019 to help boost the country’s 
biofuel production. Plans to 
produce more oil palm for 
biofuel in the state of Roraima 
follow the inauguration of a 


processing plant there last April. 


And in the west of the state of 
Amazonas, new roads could 
open up previously inaccessible 
areas for palm plantations and 
drive further devastating cycles 
of deforestation. 

After the president took over 
in January 2019 (C. A. Nobre 
Nature 574, 455; 2019), a 
government consortium 
announced investments of 
4.4 billion reais (US$1.1 billion) 
in six Amazon states — 
Amazonas, Acre, Amapa, Mato 
Grosso, Rond6nia and Roraima 
— for the installation of power 
plants fuelled by maize ethanol. 
Maize ethanol was chosen 
because of the ban on sugar- 
cane production, introduced to 
curb deforestation and the loss 
of ecosystem services essential 
to Brazil’s agriculture and to 
mitigate global climate change 
(L. Ferrante and P. M. Fearnside 
Science 359, 1476; 2018). 

Lifting the ban adds tothe 
already huge pressure on the 
rainforest from cattle ranching, 
soya-bean production, 
hydroelectric dams and mining. 


Social media sows 
ecology consensus 


All too often, social media 
deepens divisions rather than 
bridges them. But we have 

hada refreshingly different 
experience: through an open 
call on Twitter, we assembled 

a global team of more than 

50 authors at different career 
stages and froma range of 
scientific disciplines. We aim to 
build a consensus on the drivers 
of ecological disturbances and 
their impact (see go.nature. 
com/2pwwrrb). 

Disturbances suchas 
wildfires, hurricanes and 
sea-level rise have long- 
lasting implications that 
can fundamentally alter the 
structure and functions of an 
ecosystem. Understanding 
their complex causes and 
consequences requires rigorous 
analysis of multi-stressor 
interactions. To this end, 
we developed an integrated 
concept of disturbance ecology 
that is independent of the 
scale of the investigation. This 
method draws together ideas 
from disciplines such as data 
science, molecular ecology and 
palaeoecology. Our approaches 
are based on input from places 
as ecologically varied as 
Argentina, Luxembourg and 
Singapore. They fulfil minimum 
reporting standards and aim 
to meet key requirements 
for understanding long-term 
socio-ecological impacts. 

We have found that 
social media can stimulate 
open science and attract 
contributions from researchers, 
irrespective of their career stage 
or financial support. 
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In defence of the 
paper census 


We agree that hard-copy 
censuses are crucial to public- 
health surveys in developing 
economies (see Nature 574, 296; 
2019). In our view, they should 
be continued and not replaced 
by online ‘big data’ collection. 
Furthermore, your suggestion 
that non-governmental 
organizations (NGOs) — as 
opposed to governments — 
could carry out censuses would 
pose several risks. 

For example, NGOs would be 
unlikely to guarantee country- 
level coverage, uniformity in 
data collection, periodicity or 
commitment to conservation 
and oversight of data access. 
That could seriously impair the 
availability, quality, long-term 
observability and accessibility 
of complete national data sets. 
Collection of data online only 
is a limiting factor, especially 
for the most disadvantaged 
segments of the population. 
Although electronic gathering 
aided by collectors might 
improve coverage, it could also 
breach rights to privacy. 

More frequent sampling, as 
envisaged by some countries, 
would require data sets to be 
catalogued year on year for 
comparative purposes, raising 
the possibility of missed or 
double-counted entries. 

We urge caution. All too often, 
governments are willing to 
renege on their responsibilities 
to citizens — particularly if 
NGOs are poised to meet the 
challenge. The work of Roy 
Burstein and his colleagues in 
tracking child mortality (Nature 
574, 353-358; 2019) would not 
have been possible without the 
granularity of household-level 
census returns. 
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© 2020 Springer Nature Limited. All rights reserved. 


Consider context to 
avert catastrophe 


Alisha Graves and her colleagues 
suggest that catastrophein 
Africa’s Sahel region could 

be averted by boosting food 
production and personal 
security, along with improving 
efforts to slow population 
growth (Nature 575, 282-286; 
2019). In my view, however, 
their suggestions overlook 
the realities of conflicts onthe 
ground. 

Increasing military spending 
can, as they themselves point 
out, aggravate the problem. 
Heavy-handed responses by 
the army, such as scorched- 
earth tactics and human- 
rights abuses, undermine 
communities’ ability to cope 
with the effects of climate 
change and risk driving 
up recruitment to armed 
opposition groups (see https:// 
shoring-up-stability.org/). Even 
national governments in the 
region accept the need to step 
away from military force and 
move towards governance that 
can tackle problems. 

The authors suggest that 
more international aid could 
address the shortfall in 
basic services (in health and 
education, for example) that 
is resulting from increased 
military spending. Evidence 
from the region points to the 
contrary: failure of governance 
and service provision by 
national and local governments 
stoked initial grievances, which 
continue to drive conflict. 
Increased dependence on 
foreign aid could lock in this 
problem. 

Bringing science and policy 
together can inform better 
responses, but these need to 
take the context and its people 
as the starting point. 


Janani Vivekananda Adelphi, 
Berlin, Germany. 
vivekananda@adelphi.de 
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Controllable strain 
enhances a semiconductor 


Jian Shi 


Semiconductors known as halide perovskites have remarkable 
optoelectronic properties, but their structural instability 
limits practical applications. A solution has been found that 
involves squeezing the compounds crystal lattices. See p.209 


The surprising discovery’ in 2009 that 
compounds known as halide perovskites can 
convert sunlight into electricity triggered 
a revolution in photovoltaics (solar cells), 
inspiring new cell designs that will enable 
solar energy to be harnessed efficiently and 
at low cost. However, tuning the properties 
of these crystals to enable practical applica- 
tions has been a long-standing challenge. On 
page 209, Chenetal.’ report that asolutionto 
this problem has finally been found. 

Discovering new materials and identify- 
ing appropriate applications can sometimes 
be achieved serendipitously, but often 
takes decades, and has historically required 
centuries or even millennia. The first halide 
perovskite was discovered in the 1890s’, but 
its potential remained untapped until a decade 
ago’, when it took photovoltaics by storm. 
As its name suggests, halogen atoms in the 
compound enable the formation of a cubic 
(or pseudocubic) array knownasa perovskite 
structure. 

Harnessing the full potential of halide 
perovskites for technological applications 
has been difficult. A major obstacle is the 
tendency of one of the best-performing per- 
ovskite crystals, a-formamidinium lead iodide 
(HC(NH,),PbI,, known as a-FAPDI,), to assume 
a hexagonal structure at room temperature 
(Fig. 1a) — the approximate temperature at 
which photovoltaic devices operate. This 
hexagonal structure cannot respond to most 
of the frequencies of light in solar radiation, 
and hence is not of interest for technological 
applications. It would therefore be helpful to 
stabilize the structure of a-FAPbI,. 

Several strategies can be used to engineer a 
material’s properties. Two of the most effec- 
tive are to alter the material’s composition or 
to use it at high or low temperatures, but the 


Cubic structure 
of a-FAPbl, 


Strained 
lattice 


Substrate 


Figure 1| The stabilization and strain engineering 
of asemiconductor. a, The semiconductor 
a-formamidinium lead iodide (HC(NH,),PbI, known 
as a-FAPbI,) belongs to the halide perovskite family 
of compounds, and has potential applications in 
solar cells. However, its cubic perovskite lattice 

is unstable, and converts to a hexagonal form 

that is unsuitable for practical applications. The 
octahedra represent subunits of the lattices: iodine 
atoms at the vertices surround a central lead atom; 
(HC(NH,),)* ions fill the gaps between octahedra, 
but are not shown. b, Chen et al.? report that the 
structure of a-FAPbI, can be stabilized by growing 

it ona stable halide perovskite (the substrate) that 
has an analogous structure, so that the atoms in the 
two lattices align. Because the lattice dimensions 

of the substrate are smaller than those of a-FAPbI,, 
the crystal lattice of the latter is squeezed (put 
under strain). This strain increases the mobility of 
electrical charge carriers in the material. 
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costs involved for commercial applications 
can be tremendous. Scientists have there- 
fore developed a simple but extremely useful 
approach knownas strain engineering, which 
has been used to tune the electronic properties 
of semiconductors. 

When acrystal is compressed or stretched, 
the resulting deformation is called strain. 
Strain is calculated by dividing the change of 
length of a deformed object by the object’s 
original length, and is expressed as a percent- 
age of the original length. Insemiconductors, 
strain can alter the mobility of charge carri- 
ers — aproperty that characterizes how fasta 
charge carrier, such as an electron, can travel 
inacrystal subjected to an electric field. By 
changing the charge mobility, the electronic 
properties of asemiconductor can be altered. 

Semiconductors in modern electronic 
devices are most commonly used as thin 
films, and the maximum sustainable strain of 
a film is less than a few per cent*. Neverthe- 
less, strain modulation can be a highly effec- 
tive tool for enhancing electronic properties. 
One such success story is that of lasers based 
onlayered semiconductor structures knownas 
quantum wells. By straining the quantum-well 
layers, the electrical current needed to power 
alaser can be reduced by as muchas tentimes, 
thus improving the energy efficiency of the 
devices**. Most commercial quantum-well 
lasers are therefore strained’. 

One of the most interesting but problematic 
features of halide perovskites is that only a 
few, including a-FAPbI,, have high charge 
mobility and absorb light strongly, but the 
technologically useful crystal structures of 
these few compounds are unstable — some can 
spontaneously transform into other phases in 
less than a second. In photovoltaic devices, 
fast-moving charge carriers and strong light 
absorption are both needed to convert solar 
energy into electricity with high efficiency. 
Unfortunately, structural stability, high charge 
mobility and the ability to absorb light strongly 
don’t seem to coexist in perovskite halides. 

Chen et al. have therefore used strain 
engineering to tackle this problem. They grew 
crystalline a-FAPbI, from a solution so that it 
formed on another, more stable halide per- 
ovskite (the substrate). The FAPbI, atoms inthe 
growing crystal align with the cubic structure 
of the atoms inthe substrate, thereby forming 
a pseudocubic structure themselves (Fig. 1b). 
The alignment of atoms in different materials 
is called epitaxy. This epitaxy locks a-FAPbI, 
into the pseudocubic structure asa result of the 
strong chemical forces between it and the sub- 
strate, preventing its transformation into the 
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undesirable hexagonal structure. The authors 
find that the pseudocubic structure remains 
stable for at least a year at room temperature. 

A compressive strain is imposed on the 
a-FAPDbI, film because the dimensions of the 
cubic array of the substrate are different from 
those of the natural atomic array of a-FAPbI,. 
Chen and colleagues were therefore able to 
control the strain of x-FAPbI, from 0 to 2.4% 
compressive deformation by growing FAPbI, 
on substrates that have different lattice dimen- 
sions. The authors found that this squeezing 
of the a-FAPbI, crystal increases the mobility 
of positively charged quasiparticles called 
holes, which correspond to the absence of 
electrons inthe crystal. The authors attribute 
this increased mobility to the modification of 
the electronic structure of the crystal under 
compressive strain: compression leads to 
faster oscillations of the holes’ wavefunctions, 
speeding up the movement of charge wave- 
packets (superpositions of wavefunctions) 
and thus producing higher charge mobility. 

Previous work on the strain engineering of 
halide perovskite films lacked strain control’ 
orinvolved straining methods that are harder 
to use®’. By contrast, Chen and colleagues’ 
study provides an extremely accessible and 
practical avenue through which to explore and 
use the physical properties of strained halide 
perovskites. 

Questions remain about how the authors’ 
findings could be used in solar cells. Currently 
available halide perovskite photovoltaic 
devices do not contain a genuinely epitaxial 
substrate, and so new cell designs will be 
needed to make use of the reported discovery. 
But a range of halide perovskite compounds 
are available that have similar atomic arrays 
to a-FAPbI,, and which exhibit many different 
technologically important electronic proper- 
ties. Chen and co-workers’ study therefore sug- 
gests that there is plenty of scope for designing 
and developing epitaxial quantum-well devices 
using these materials, by mimicking the way in 
which quantum-well devices were developed 
using semiconductors from the III-V family 
of materials. This might bring down the cost 
of manufacturing these devices. 

Finally, it will be interesting to see whether 
crystals of halide perovskites can be grown 
with sufficient atomic precision to make 
superlattices — periodic structures that con- 
tain multiple layers of two or more materials. 
The use of halide perovskites in superlattices 
could open up otherwise inaccessible elec- 
tronic band structures, thereby allowing a rich 
array of physics to be explored, and emerging 
quantum-well devices to be further developed. 
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Predator-prey cycles 


achieved at last 


Alan Hastings 


Acombination of laboratory experiments and mathematical 
and statistical analysis provides an affirmative answer toa 
decades-old question — cana predator and its prey coexist 


indefinitely? See p.226 


Akey question in ecology is what allows species 
to persist over time — particularly when there 
are pairs of species in which oneis an exploiter 
and the other its victim. A long-standing theory 
attempts to answer this question by explaining 
how relative numbers of predators and their 
prey can cycle continuously’. First, prey num- 
bers would increase, giving the predator more 
food. The subsequent increase in predators 
would lead toa decline in prey. Predator num- 
bers would then decline owing toa lack of food, 
restarting the cycle. However, it has proved 
unexpectedly challenging to demonstrate 
this type of persistent predator-prey cyclein 
simple controlled systems in the laboratory. 
On page 226, Blasius et al.” report just sucha 
demonstration, succeeding where almost 90 
years of experimental work has failed. 

Ecological theories of persistent predator-— 
prey cycles are supported by the apparent 
existence of such cycles in nature, for instance 
between the lynx and hare in Canada’. How- 
ever, it is hard to prove that these cycles 
endure in the wild, because observations 
over many decades would be needed. But if 
the theories provide a complete explanation 
of natural cycles, then it should be possible to 
demonstrate persistent cyclic behaviour in 
the laboratory, using species that have much 
shorter cycle times. 

The challenge posed by such a demonstra- 
tion was exemplified in 1934 by the ecologist 
Georgii Gause*, who studied the dynamics 
of two unicellular organisms — the predator 
Didinium nasutum and its prey, Paramecium 
caudatum. Gause found that, onthe one hand, 
if the predator was efficient, it ate up all the 
prey and then starved. On the other hand, if 
part of the environment helped to conceal 
the prey, the predator was less efficient — and 
so starved (Fig. 1a). Coexistence and long- 
term cycles could be achieved only through 
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artificial means — namely, by adding prey at 
regular intervals. 

In 1974, work with the same system showed 
that, by making the predator less efficient and 
by providing the prey with less food, the two 
populations could persist for longer°. Even so, 
coexistence could be maintained for just a few 
predator-prey cycles. Since then, some mod- 
els that allow long-term cycle persistence have 
focused on space, for instance incorporating 
metapopulation dynamics*. In this phenom- 
enon, subpopulations of a species migrate 
around a larger region. Although a subpopu- 
lation might become extinct in one area, the 
species persists across the region as a whole 
and can migrate back into that area. However, 
a better understanding of whether exploiter— 
victim cycles can persist locally without exter- 
nal input is still sorely needed. 

Blasius and colleagues studied the aquatic 
invertebrate Brachionus calyciflorus and its 
prey, the green algal species Monoraphidium 
minutum. They found that, under simple 
and constant environmental conditions, the 
two species could coexist for more than a 
year — that is, over SO predator-prey cycles. 
This result finally demonstrates that the 
long-standing theory of persistent cycles 
can be consistent with the reality of simple 
ecological systems. 

Next, the researchers carried out arigorous 
statistical analysis of the cycle dynamics in 
their system. Specifically, they used wavelet 
analysis, which focuses on dynamics over 
short periods; the technique has become 
a standard way to study the presence of 
periodic behaviour in ecological time series’. 
The analysis revealed interesting dynamic 
phenomena. The oscillations in the relative 
numbers of each species showed a characteris- 
tic lagin phase, with predator numbers mostly 
changing in the wake of altered prey numbers. 
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Predator-prey cycles 


achieved at last 


Alan Hastings 


A combination of laboratory experiments and mathematical 
and statistical analysis provides an affirmative answer toa 
decades-old question — cana predator and its prey coexist 


indefinitely? 


Akey question in ecology is what allows species 
to persist over time — particularly when there 
are pairs of species in which oneis an exploiter 
and the other its victim. A long-standing 
theory attempts to answer this question by 
explaining how relative numbers of preda- 
tors and their prey can cycle continuously’. 
First, prey numbers would increase, giving the 
predator more food. The subsequent increase 
in predators would lead to a decline in prey. 
Predator numbers would then decline owing 
toa lack of food, restarting the cycle. How- 
ever, it has proved unexpectedly challenging 
to demonstrate this type of persistent preda- 
tor-prey cyclein simple controlled systems in 
the laboratory. Writing in Nature, Blasius et al. 
reportjust such a demonstration, succeeding 
where almost 90 years of experimental work 
has failed. 

Ecological theories of persistent predator-— 
prey cycles are supported by the apparent 
existence of such cycles in nature, for instance 
between the lynx and hare in Canada*. How- 
ever, it is hard to prove that these cycles 
endure in the wild, because observations 
over many decades would be needed. But if 
the theories provide a complete explanation 
of natural cycles, then it should be possible to 
demonstrate persistent cyclic behaviour in 
the laboratory, using species that have much 
shorter cycle times. 

The challenge posed by such a demonstra- 
tion was exemplified in 1934 by the ecologist 
Georgii Gause*, who studied the dynamics 
of two unicellular organisms — the predator 
Didinium nasutum and its prey, Paramecium 
caudatum. Gause found that, onthe one hand, 
if the predator was efficient, it ate up all the 
prey and then starved. On the other hand, if 
part of the environment helped to conceal the 
prey, the predator was less efficient — and so 


starved (Fig. 1a). Coexistence and long-term 
cycles could be achieved only through artifi- 
cial means — namely, by adding prey at regular 
intervals. 

In 1974, work with the same system showed 
that, by making the predator less efficient and 
by providing the prey with less food, the two 
populations could persist for longer°. Even so, 
coexistence could be maintained for just afew 
predator-prey cycles. Since then, some mod- 
els that allow long-term cycle persistence have 
focused on space, for instance incorporating 
metapopulation dynamics*®. In this phenom- 
enon, subpopulations of a species migrate 
around a larger region. Although a subpopu- 
lation might become extinct in one area, the 
species persists across the region as a whole 
and can migrate back into that area. However, 
a better understanding of whether exploiter— 
victim cycles can persist locally without exter- 
nal input is still sorely needed. 
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Blasius and colleagues studied the aquatic 
invertebrate Brachionus calyciflorus and its 
prey, the green algal species Monoraphidium 
minutum. They found that, under simple 
and constant environmental conditions, the 
two species could coexist for more than a 
year — that is, over SO predator-prey cycles. 
This result finally demonstrates that the 
long-standing theory of persistent cycles 
can be consistent with the reality of simple 
ecological systems. 

Next, the researchers carried out arigorous 
statistical analysis of the cycle dynamics in 
their system. Specifically, they used wavelet 
analysis, which focuses on dynamics over 
short periods; the technique has become 
a standard way to study the presence of 
periodic behaviour in ecological time series’. 
The analysis revealed interesting dynamic 
phenomena. The oscillations in the relative 
numbers of each species showed a characteris- 
ticlagin phase, with predator numbers mostly 
changing inthe wake of altered prey numbers. 
However, these oscillations would undergo 
sudden shifts, without any external driver. 
During these transient periods, both popu- 
lations would oscillate out of sync with one 
another, before the in-phase cyclic dynamic 
resumed (Fig. 1b). 

The authors also performed experiments 
in which they introduced pulses of nutrients 
to the species’ environment. This mimics 
seasonal environmental changes experienced 
in many natural systems. The overall dynamics 
of the predator-prey cycle remained the same 
inthese conditions, indicating that predator- 
prey interactions can govern cycle dynamics, 
even ina seasonally varying environment. 

Although Blasius and colleagues’ work 


Cycles 
out of sync 


160 


Figure 1| A difficult balance to strike. Ecological theory predicts that the relative numbers of a predator and 
its prey should oscillate in persistent cycles, but demonstrating this in the laboratory has been hard. a, Classic 
experiments‘ from the 1930s failed to achieve cycles, because predators were either too inefficient at eating 
prey (not shown) or too efficient, and ate them all. Either way, predator numbers dwindled. b, Blasius et al.” 
have succeeded in achieving persistent predator-prey cycles in the laboratory. They found that cycles were 
mostly in phase, with predator cycles slightly lagging behind prey cycles. However, there were transient 
periods in which the two — for unknown reasons — fell out of sync. (Adapted from Fig. 2b of the paper.) 
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answers one question, it raises others. First 
and foremost, why are the results so differ- 
ent from those of earlier experiments? It is 
not just a quirk of the species used, because 
the authors repeated the experiment using a 
different prey species and achieved the same 
result. One contributing factor could be that 
the researchers used a predator that has acom- 
plex life history involving transitions through 
several stages, from egg to adulthood. Another 
could be that persistence depends on the 
specific amplitude of oscillation in the cycle. 

In support of these ideas, previous work 
involving a predator with a simpler life cycle 
demonstrated that populations of predators, 
prey or both went extinct if cycles reached 
an amplitude at which population numbers 
become low’. In addition, analysis of a dif- 
ferent marine predator-prey combination® 
showed that cyclic dynamics eventually gave 
way either to extinction or to persistence of 
the two populations in equilibrium. And math- 
ematical theory! suggests that the small-ampli- 
tude cycles observed by Blasius and colleagues 
would occur only under very specific condi- 
tions. Demonstrating persistent cycles in 
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other systems is therefore a key challenge for 
the future. The life-history characteristics of 
the predator used by Blasius and colleagues 
could be a starting point in the search for 
systems that show similar behaviour. 

Another intriguing avenue for future study 
is to determine the relationship between 
transient dynamics and switches in system 
behaviour. Does a sudden change in the 
dynamics of acycling system reflect changing 
environmental conditions, or is it anintrinsic 
phenomenon? The existence of unexpected, 
transient shifts in predator-prey dynamics 
limits the predictability of ecological systems”. 
In the real world, sudden shifts of this nature 
could spur an erroneous quest to identify 
drivers of the change, whereas the current 
study indicates that there may be none. The 
possibility of transient dynamics could also 
make it more difficult to predict the effect of 
ecological-management strategies. 

Blasius et al. have provided one of the 
clearest demonstrations so far of transient 
dynamics in the laboratory, where it is abso- 
lutely certain that no external influences 
caused the change. It will be important to bring 
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together this work (and another convincing 
experimental demonstration’’) with 
theoretical models that are currently being 
developed’. Doing so should provide more 
insights into the role of transient dynamics 
innatural systems. 
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Figure 1|A difficult balance to strike. Ecological theory predicts that the relative numbers of a predator and 
its prey should oscillate in persistent cycles, but demonstrating this in the laboratory has been hard. a, Classic 
experiments‘ from the 1930s failed to achieve cycles, because predators were either too inefficient at eating 


prey (not shown) or too efficient, and ate them all. Either way, predator numbers dwindled. b, Blasius et al.” 
have succeeded in achieving persistent predator-prey cycles inthe laboratory. They found that cycles were 
mostly in phase, with predator cycles slightly lagging behind prey cycles. However, there were transient 
periods in which the two — for unknown reasons — fell out of sync. (Adapted from Fig. 2b of the paper.) 


However, these oscillations would undergo 
sudden shifts, without any external driver. 
During these transient periods, both popu- 
lations would oscillate out of sync with one 
another, before the in-phase cyclic dynamic 
resumed (Fig. 1b). 

The authors also performed experiments 
in which they introduced pulses of nutrients 
to the species’ environment. This mimics 
seasonal environmental changes experienced 
in many natural systems. The overall dynamics 
of the predator-prey cycle remained the same 
inthese conditions, indicating that predator- 
prey interactions can govern cycle dynamics, 
even in a seasonally varying environment. 

Although Blasius and colleagues’ work 
answers one question, it raises others. First 
and foremost, why are the results so differ- 
ent from those of earlier experiments? It is 
not just a quirk of the species used, because 
the authors repeated the experiment using a 
different prey species and achieved the same 
result. One contributing factor could be that 
the researchers used a predator that has acom- 
plex life history involving transitions through 
several stages, from egg to adulthood. Another 
could be that persistence depends on the 
specific amplitude of oscillation in the cycle. 

In support of these ideas, previous work 
involving a predator with a simpler life cycle 
demonstrated that populations of predators, 
prey or both went extinct if cycles reached 
an amplitude at which population numbers 
become low*. In addition, analysis of a dif- 
ferent marine predator-prey combination® 
showed that cyclic dynamics eventually gave 
way either to extinction or to persistence of 
the two populations in equilibrium. And math- 
ematical theory’ suggests that the small-ampli- 
tude cycles observed by Blasius and colleagues 
would occur only under very specific condi- 
tions. Demonstrating persistent cycles in 
other systems is therefore a key challenge for 
the future. The life-history characteristics of 
the predator used by Blasius and colleagues 


could be a starting point in the search for 
systems that show similar behaviour. 
Another intriguing avenue for future study 
is to determine the relationship between 
transient dynamics and switches in system 
behaviour. Does a sudden change in the 
dynamics of acycling system reflect changing 
environmental conditions, or is it anintrinsic 
phenomenon? The existence of unexpected, 
transient shifts in predator-prey dynamics 
limits the predictability of ecological systems”. 
In the real world, sudden shifts of this nature 
could spur an erroneous quest to identify 
drivers of the change, whereas the current 
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study indicates that there may be none. The 
possibility of transient dynamics could also 
make it more difficult to predict the effect of 
ecological-management strategies. 

Blasius et al. have provided one of the 
clearest demonstrations so far of transient 
dynamics in the laboratory, where it is abso- 
lutely certain that no external influences 
caused the change. It will be important to bring 
together this work (and another convincing 
experimental demonstration’’) with 
theoretical models that are currently being 
developed’. Doing so should provide more 
insights into the role of transient dynamics 
innatural systems. 
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The physics 
ofice skating 


Daniel Bonn 


The slipperiness of ice is poorly understood at a microscopic 
level. Experiments that probe how the surface of ice melts and 
flows in response to wear help to explain the exceptionally low 
friction that underpins winter sports. 


Itis widely thought that ice skating is enabled 
by the formation ofa layer of water onthe sur- 
face of ice, which lubricates the skate. Writing 
in Physical Review X, Canale et al. report that, 
although there is indeed a layer of something, 
it is not simply water — the lubricating layer 
has surprising flow properties that are very 
different from those of bulk water, and are 
intermediate between those of liquid water 
andice. 

The idea that a thin film of meltwater wets 
the surface of ice has been accepted since 
the nineteenth century’. This layer should 
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be present beneath skates or other objects 
sliding on ice. Most of the debate about this 
topic, therefore, deals with the origin of the 
layer —isitthe result of surface melting, heat- 
ing associated with friction generated by the 
skate, or pressure-induced melting? Pressure 
melting has largely been discounted, because 
this process is impossible for ice below about 
-20 °C, whereas skating is possible at such 
low temperatures. Surface melting and fric- 
tional heating also seem unlikely explanations 
because these phenomena are not specific 
to solid water ice, yet ice is almost the only 
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Figure 1| Probing the lubricating layer of ice. Canale et al.’ investigated the thin layer of melt water that 
lubricates the slipping of objects onice. The authors attached a millimetre-scale glass bead to a prong ofa 
macroscopic aluminium tuning fork. The fork was made to vibrate, causing the bead to oscillate over the 

ice surface. An accelerometer on the fork measured the amplitude of the bead’s oscillations parallel and 
perpendicular to the surface. From these measurements, the friction between the bead and the ice, and the 
flow properties (rheology) of the lubrication layer, could be calculated. The observed viscosity and elasticity 
of the layer suggest that it consists of a mixture of water and ice particles. 


material that it is possible to skate on. 

Also unknown are the mechanical proper- 
ties of the meltwater layer. Whether a liquid 
lubricates or not is mostly determined by its 
resistance to being squeezed out from the 
gap between the two rubbing surfaces — thick 
grease is usually needed for good lubrication. 
Water is not a good lubricant, because its low 
viscosity means that it is easily squeezed out of 
gaps. The idea that a layer of water is sufficient 
to lubricate a skate oniceis therefore strange. 
It doesn’t even make intuitive sense, given that 
it is impossible to skate on aroad or akitchen 
floor witha layer of water on it. 

Canale et al. report clever experiments 
that investigate the microscopic mechanisms 
responsible for the slipperiness of ice (Fig. 1). 
The authors measured the coefficient of 
friction (a measure of the friction produced 
at a surface) of ice and the properties of the 
lubrication layer simultaneously, using an 
experimental set-up that resembles a large 
tuning fork. The fork was made to vibrate, so 
that a millimetre-scale glass bead attached to 
one of its prongs oscillated across an ice sur- 
face. The bead thus functioned as a tiny ice 
skate, gliding for distances of the order of tens 
of micrometres across the same region of ice. 

An accelerometer attached to the same 
prong of the fork as the bead measured the 
amplitude of the bead’s oscillations parallel 
to the surface, and compared them with the 
amplitude of the driving force. Canale et al. 
used the difference in amplitude to calculate 
the friction force between the bead and the ice. 
Simultaneously, tiny oscillations of the bead 
(restricted to just a fewtens of nanometres, so 
as not toinfluence the friction measurements) 
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perpendicular to the ice surface allowed the 
authors to probe the properties of the lubri- 
cation layer. The authors used a similar pro- 
cedure to the one used to measure parallel 
movements to measure the forces acting on 
the bead perpendicular to the surface. The 
coefficient of friction between glass and ice 
could then be calculated from the ratio of the 
two measured forces. 

Canale and colleagues’ procedure not only 
measured the amplitude change of the perpen- 
dicular movements, but also the time taken for 
the driving force to produce these oscillations 
of the bead (the phase shift). This allowed the 
authors to measure the mechanical imped- 
ance of the lubricating layer on the ice — a 
measure of its resistance to forces applied 
by the bead. These measurements, in turn, 
correlate with the flow properties (rheology) 
of the lubrication layer. 

Taken together, the authors’ measurements 
allowed the friction and rheology of the lubri- 
cation layer to be probed simultaneously. The 
experiments show that the lubrication layer 
has both viscous and elastic responses to an 
object sliding on top of it. This is not only differ- 
ent from the behaviour of bulk water, but also 
tothat ofice. The results, therefore, reveal that 
skating requires more than just a layer of water. 

So what is happening? Sophisticated spec- 
troscopy techniques have previously been 
used to study the surface ofice at different tem- 
peratures’, and revealed that melting occurs 
discontinuously through successive bilayers of 
ice molecules — the first bilayer melts at about 
-70°C, and the second one at -20°C. Friction 
measurements over that same temperature 
range have shown, however, that friction at the 
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surface doesn’t change as the bilayers melt*. 
It therefore seems that the slipperiness of ice 
does notjust depend onits surface properties 
during melting, but also onits interaction with 
the skate. 

Canale and co-workers’ experiments 
support that idea. The authors suggest that 
repeated sliding over the same spot generates 
amixture of ice and water, which displays both 
elastic behaviour (from the ice) and viscous 
behaviour (from the water) in response toa 
load. The resulting layer of material would 
be more difficult to squeeze out of gaps 
than ordinary water. This could — at least in 
part — explain the layer’s excellent lubrication 
properties. 

The newstudy also provides the beginning of 
an explanation of why skating is so specific to 
ice. Lubricated friction and concomitant wear 
often lead to the formation of a layer of differ- 
ent material, known as a third body, between 
the two rubbing objects. Because this process 
relies on the specific ways in which the surfaces 
of rubbing objects wear away, and often even 
on friction-induced chemistry, the formation 
of a third body is largely system-specific. In 
other words, few materials can form a visco- 
elastic, liquid-solid third body in response 
to friction and wear, such as that observed in 
Canale and colleagues’ experiments. 

Interesting questions remain. For example, 
does the observed layer form rapidly enough 
on pristine ice to lubricate on contact? And 
how does the presence of the layer explain 
that the optimal ice temperature for skating** 
is about —7 °C? (The temperature at which all 
speed-skating rinks are kept.) Beyond ice 
skating, detailed measurements of the prop- 
erties of thin lubrication films are likely to be 
useful, because many such films do not show 
the same properties as the bulk lubricant®, and 
because the origin and effects of the third body 
inlubrication remaina largely open question. 
These are important issues because lubrication 
is often the only way to reduce friction, whichis 
responsible for an estimated 20% of the world’s 
energy consumption’. 
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Neuroscience 


Internal brain 


States in motion 


Ethan Scott 


The brain fluctuates between different internal states, each 

of which drives particular behaviours. Brain-wide imaging 
reveals the internal states that help zebrafish larvae to choose 
between exploring and hunting. See p.239 


Internal brain states modulate how we 
perceive, feel about and respond to the stim- 
uli around us. The neurons and networks that 
encode these various states remain mysteri- 
ous, owing to the brain’s enormous complex- 
ity. As aresult, researchers have only a vague 
sense of how animals, including humans, 
use recent experience and context to select 
appropriate behaviours. On page 239, Marques 
et al. perform a first-of-its-kind exploration 
of brain-wide activity in zebrafish larvae as 
the animals encounter a classic state-driven 
behavioural choice: whether to explore their 
environment or to exploit resources in the 
local area. By monitoring individual neurons 
across the brain as these decisions play out, 
the authors open a window onto the neural 
underpinnings of internal states and how they 
can influence behaviour. 

Advanced microscopy is revolutionizing 
the ways in which we observe the functioning 
brain. Historically, neural activity has been 
monitored either using brain-wide methods 
that cannot resolve individual neurons, or 
with electrophysiology, which records indi- 
vidual neurons or small localized populations. 
Neither approach can address activity across 
vast assemblies of individual neurons — which 
is exactly what our brains are. 

Genetically encoded calcium indicators 
(GECIs) offer a possible solution to this 
problem. These proteins provide a pulse 
of fluorescence when a neuron is active. 
When expressed throughout the brain and 
combined with the right microscope, they pro- 
duce a blinking light field that can reveal the 
activity of thousands of individual neurons 
simultaneously. 

Zebrafish larvae are small and transparent, 
making it possible to image GECIs across their 
entire brains at cellular resolution’*. How- 
ever, microscopy techniques have typically 
required that the animals are stationary, either 
paralysed or with their heads embedded in 
agarose, hampering our ability to analyse 
how sensory processing relates to natural 
behaviour. 


Brain-wide GECI imaging during free- 
swimming behaviour became possible when 
the group behind the current study intro- 
duced a tracking fluorescence microscope’. 


“Theauthors have delivered 
asatisfying scheme for how 
exploration and exploitation 
behaviours are controlled by 
internal brain states.” 


This machine constantly monitors and pre- 
dicts the movements of a free-swimming 
larva, and moves the imaging platform to 
compensate for the animal’s movement, 
keeping the brain in the microscope’s field of 
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view. Meanwhile, it uses brief pulses of light 
to record fluorescence signals from GECIs 
in the brain. After spatial alignment of the 
images across the experiment, the approach 
produces brain-wide, cell-level data on neural 
activity, complemented by behavioural infor- 
mation on the animal’s position, speed and 
movements. 

In the current study, Marques and col- 
leagues used this platform to explore the 
neural underpinnings of two larval behav- 
iours that involve free swimming: explora- 
tion (efficient swimming over distance) and 
exploitation (localized predation). Explora- 
tion and exploitation place opposing pulls 
on diverse animals>* — the former offers the 
possibility of resource discovery, but at a 
metabolic cost, and the latter is an effective 
approach for predatory feeding, but only if 
prey are present. These behavioural modes are 
influenced by external inputs and hunger level, 
but also alternate spontaneously, suggesting 
that internal brain states have a role in which 
behaviours are chosen. 

The authors recorded videos of free- 
swimming larvae, and categorized each of the 
animals’ movements to generate thorough 
descriptions of behaviours during exploration 
and exploitation phases. They showed that 
movements such as forward swimming and 
routine turns tend to occur at a similar time 
and indicate that the larvais in an exploratory 
phase. Other movements, including targeting 
turns and predatory strikes typical of hunting, 
identify exploitative phases. 

Marques et al. next performed GECI 


b Habenulae 


Figure 1 | Mapping exploration and exploitation. a, Using whole-brain imaging of neuronal activity 

in free-swimming zebrafish larvae, Marques et al.‘ have identified distinct categories of neuron that are 
active during phases of larval behaviour involving exploration (blue larva) and exploitation (hunting; red). 
As this cyclical graph shows, a network of neurons (blue) is steadily active throughout exploration. This 
activity is replaced by a peak ina ‘trigger network’ of neurons (yellow), which then drives the activity of 
exploitation-network neurons (red). Exploitation-network activity tails off over time, reaching a baseline as 
the exploration network becomes active once more and the cycle begins again. b, The exploration network 
is distributed throughout pre-motor regions of the zebrafish hindbrain. The exploitation network is found 
mainly in the dorsal raphe. The trigger network is also found in this region, and in the habenulae. 
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imaging to analyse which brain networks 
encode the corresponding internal states, 
and what changes occur during the transitions 
between phases of behaviour. They extracted 
activity data from tens of thousands of indi- 
vidual neurons per animal, seeking neurons 
with activity that correlated with specific 
swimming manoeuvres, with exploration 
or exploitation phases, or with transitions 
between exploration and exploitation. This 
analysis revealed three notable categories of 
neuron (Fig. 1). 

First, there are neurons dispersed across 
the brain that are active during exploration. 
Because they are broadly distributed and their 
activity does not show obvious peaks, they 
are probably involved in executing explor- 
atory manoeuvres, rather than encoding or 
triggering the state itself. 

Second, there is aneuronal population that 
is present in high numbers inastructure called 
the dorsal raphe. These neurons are power- 
fully active at the beginning of an exploitation 
phase. Their activity gradually weakens over 
time, settling to a baseline during the phase 
transition back to exploration. 

Third, there is a ‘trigger network’ of neurons, 
located in multiple localized regions across 
the brain. The activity of these neurons peaks 
sharply when the larva is transitioning from 
exploration to exploitation. The strength of 
the peak from the trigger network predicts 
the strength of the subsequent activation in 
the dorsal raphe’s exploitation-network neu- 
rons. This, inturn, predicts the duration of the 
ensuing exploitation phase. 

How does this arrangement help to deliver 
appropriate behaviours at appropriate times? 
The expectation is that exploration and 
exploitation phases are influenced both by 
hunger and by the local availability of prey. A 
hungry larva should stay where it is and exploit 
prey when they are present, and should look 
for prey if they are absent locally. In agree- 
ment with this theory, the authors found that 
exposing larvae to light after a period of dark — 
revealingthe presence oflocal preytothelarvae, 
whose predation is largely visual — caused 
increased trigger-network activity, activation 
of exploitation neurons inthe dorsal raphe, and 
predation. By contrast, however, they found 
only a weak trend towards exploitation in 
hungry animals presented with prey. These 
mixed results provide a strong incentive to 
study the combinations of environmental stim- 
uliand internal cues that spur trigger-network 
activity, and therefore feeding. 

Insummary, Marques et al. have delivered 
a plausible and satisfying scheme for how 
exploration and exploitation behaviours are 
controlled by internal brain states. Rather than 
being a conclusion, this is a departure point 
for further exploration of the underlying net- 
works. Although whole-brain GECI imaging is 
a powerful tool, it provides little information 
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on neuronal connectivity, and reveals only 
correlations between neurons activity rather 
than the causal relationships among them. 
Going forward, ablations or light-based 
optogenetic manipulations of targeted 
neurons could help to reveal exactly how 
activity patterns or network-level properties 
drive transitions between exploration and 
exploitation states. Furthermore, mapping 
out neurons’ functional properties against 
their physical architecture (as described ina 
recent atlas of cell morphologies in zebrafish’) 
or connectivity (as inferred from brain-wide 
electron-microscopy data’) would provide an 
anatomical framework for the flow of informa- 
tionthrough the system. Such functional and 
anatomical information will allow researchers 
both to test Marques and colleagues’ model of 
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internal states, and to identify the fine details 
of how exploration, exploitation and trigger 
networks interact to control behaviour. 
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Notall fast radio bursts 
are created equal 


Matthew Bailes 


Astronomical signals called fast radio bursts remain enigmatic, 
but a key discovery has now been made. A second repeating 
fast radio burst has been traced to its host galaxy, and its home 
bears little resemblance to that of the first. See p.190 


Millisecond-duration, extragalactic radio 
flashes called fast radio bursts!” (FRBs) pres- 
ent many puzzles that are strikingly similar 
to past mysteries concerning phenomena 
now known as quasars and y-ray bursts. Like 
FRBs, these phenomena seemed to be uni- 
formly distributed in the sky, but of almost 
impossible luminosity if cosmological in ori- 
gin. Ultimately, it was revealed that quasars are 
associated with supermassive black holes, and 
y-ray bursts with the formation of black holes 
after supernovae (stellar explosions) or with 
mergers of stellar remnants called neutron 
stars. Repeating FRBs provide the opportunity 
to identify the homes and possible progenitors 
of FRB sources. The first known repeater was 
localized to a star-forming region ina tiny, 
metal-poor galaxy’. On page 190, Marcote 
etal.‘ pinpoint the home of asecond repeater 
and find it to be very different. 

As an FRB travels through the ionized 
intergalactic medium, its radio waves inter- 
act with free electrons and are slowed. 
High-frequency waves are less affected than 
low-frequency ones, so the former arrive at an 
FRB detector slightly sooner than the latter. 
This time difference is proportional to the total 
number of electrons between the FRB source 
and the detector, and can therefore be used to 
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estimate the enormous distance to the source. 
Early FRB detectors could not unambiguously 
identify an FRB host galaxy because their 
localization resolutions spanned hundreds 
or even thousands of potential host galaxies. 
But now, multi-element radio telescopes such 
as the Australian Square Kilometre Array Path- 
finder (ASKAP) and the Deep Synoptic Array 
ten-antenna prototype (DSA-10) in California 


“The host galaxy is about 
200 times farther from 
Earththanis our nearest 
neighbour, the Andromeda 
galaxy.’ 


have high enough resolution that host galaxies 
can be easily identified. 

At the same time, a radio telescope known 
as the Canadian Hydrogen Intensity Map- 
ping Experiment? (CHIME) is revealing that 
many FRBs repeat. The individual bursts of a 
repeater tend to be clustered in time, which 
means that the probability of catching a burst 
is higher when a repeater is active. Because 
CHIME surveys the sky daily, it is in an ideal 
position to trigger other telescopes to carry 
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Not all fast radio bursts 
are created equal 


Matthew Bailes 


Astronomical signals called fast radio bursts remain enigmatic, 
but a key discovery has now been made. A second repeating 
fast radio burst has been traced to its host galaxy, and its home 
bears little resemblance to that of the first. 


Millisecond-duration, extragalactic radio 
flashes called fast radio bursts’? (FRBs) pres- 
ent many puzzles that are strikingly similar 
to past mysteries concerning phenomena 
now known as quasars and y-ray bursts. Like 
FRBs, these phenomena seemed to be uni- 
formly distributed in the sky, but of almost 
impossible luminosity if cosmological in ori- 
gin. Ultimately, it was revealed that quasars are 
associated with supermassive black holes, and 
y-ray bursts with the formation of black holes 
after supernovae (stellar explosions) or with 
mergers of stellar remnants called neutron 
stars. Repeating FRBs provide the opportunity 
toidentify the homes and possible progenitors 
of FRB sources. The first known repeater was 
localized to a star-forming region ina tiny, 
metal-poor galaxy’. Writing in Nature, Marcote 
etal.* pinpoint the home of asecond repeater 
and find it to be very different. 

As an FRB travels through the ionized 
intergalactic medium, its radio waves inter- 
act with free electrons and are slowed. 
High-frequency waves are less affected than 
low-frequency ones, so the former arrive at an 
FRB detector slightly sooner than the latter. 
This time difference is proportional to the total 
number of electrons between the FRB source 
and the detector, and can therefore be used to 
estimate the enormous distance to the source. 
Early FRB detectors could not unambiguously 
identify an FRB host galaxy because their 
localization resolutions spanned hundreds 
or even thousands of potential host galaxies. 
But now, multi-element radio telescopes such 
as the Australian Square Kilometre Array Path- 
finder (ASKAP) and the Deep Synoptic Array 
ten-antenna prototype (DSA-10) in California 
have high enough resolution that host galaxies 
can be easily identified. 

At the same time, a radio telescope known 


as the Canadian Hydrogen Intensity Mapping 
Experiment? (CHIME) is revealing that many 
FRBs repeat. The individual bursts of a 
repeater tend to be clustered in time, which 
means that the probability of catching a burst 
is higher when a repeater is active. Because 
CHIME surveys the sky daily, it is in an ideal 
position to trigger other telescopes to carry 
out localizations. Marcote and colleagues 
caught four bursts of one of the repeat- 
ers discovered by CHIME and used these 
observations to determine the precise location 
of the repeater in a nearby galaxy (Fig. 1). This 


galaxy is about 200 times farther from Earth 
than is our nearest neighbour, the Andromeda 
galaxy. 

The first known repeater differed in at 
least three ways from many of the one-off 
FRBs. First, it showed peculiar changes in 
intensity as a function of frequency and time, 
akin to intense solar radio emissions called 
solar radio bursts. Second, it traversed a 
highly magnetized plasma that led to a twist 
inits polarization as a function of frequency. 
And third, it was emitted near a super- 
massive black hole. Although one-off FRBs 
are harder to study than are repeaters, they 
are known to sometimes have complicated, 
multi-component temporal profiles®, simi- 
lar to the emissions from magnetars (highly 
magnetized neutron stars). 

Could it be that one-off FRBs and repeaters 
are like short- and long-duration y-ray bursts 
— two different classes of phenomenon that 
just happen to have a similar emission time- 
scale and luminosity? Or is there a theory of 
FRBs that would link the two classes? Most 
FRB theories involve a compact astronomi- 
cal object, and emissions that are associated 
with extremely powerful magnetic fields or 
rapidly spinning (and therefore energetic) 
neutron stars. Some scientists, desperate 
for an explanation, have even proposed alien 
transmissions’. 


Figure 1 | Localization of a repeating fast radio burst. Fast radio bursts (FRBs) are bright, extragalactic 
radio flashes of unknown physical origin. Marcote et al.* have determined the precise location of a source 
of repeating FRBs (indicated by the red circle). Unlike the previous localization of a repeating FRB’, this 
source was found ina fairly ordinary, massive spiral galaxy. The image is taken using optical light, anda 
higher-contrast zoom-in of the relevant region of the galaxy is included. (Image from ref. 4.) 
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Itis possible to envisage a unifying model 
in which FRB sources have an energy reservoir 
(whether magnetic, kinetic or gravitational) 
that is plentiful when the source is produced 
near a star-forming region, causing frequent 
bursts. As this reservoir becomes depleted, the 
bursts are separated by increasingly long gaps, 
until they cease altogether. Such behaviour is 
similar to that of ‘glitches’ (sudden changesin 
spin frequency) in atype of neutron star called 
apulsar. Young pulsars often glitch prolifically, 
but older pulsars can go for decades without 
exhibiting a glitch. Frequently repeating FRBs 
would be easier to find than those that have 
a lower repetition rate, but verifying that 
they are repeaters would become increas- 
ingly difficult as their repetition timescales 
approached the lifetime of FRB experiments 
(currently years). 

The first known repeater seemed to fit 
such a model. It could have been a young 
magnetar emitting from a relatively exotic 
location — near a supermassive black hole 
in the star-forming region of a small, met- 
al-poor galaxy. Such an environment is very 
different from that of most young neutron 
stars in our own Galaxy. However, Marcote 
and colleagues found that the source of 
their repeater is not particularly magnetic, 
is nowhere near a supermassive black hole 
andis ina fairly normal galaxy. Even so, the 
source is close to a curious, V-shaped region 
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of stars that only adds to the mystery of FRB 
origins (Fig. 1). 

Inthe past year, ASKAP has localized several 
one-off FRBs®’ and found them to inhabit a 
wide range of galaxy types. The beauty of the 
ASKAP results and those of Marcote etal. liesin 
theaccuracy and reliability of the localizations. 
These features are not available to relatively 
compact telescopes”, such as DSA-10, which 
can only hint at the identity of the relevant 
galaxy (think of this as the FRB’s home city), 
rather than pinpointing the exact environment 
within it (the FRB’s home address). 

Ultimately, we will discover that, for each 
class of FRB, either these signals are produced 
near active sites of star formation or there is 
no particular correlation. If they originate 
near the sites of supernovae, the FRBs prob- 
ably arise from young, highly magnetic neu- 
tron stars. If their origins are farther from 
such sites, they might be produced by time 
bombs of gravitational energy (such as merg- 
ing neutron stars) and have diverse spatial 
distributions similar to those of short-duration 
y-ray bursts, which are now known to be associ- 
ated with neutron-star coalescence. Curiously, 
because the luminosity of a typical quasar is 
not dissimilar to the peak luminosity of an 
FRB, we cannot yet completely discount the 
involvement of supermassive black holes. But 
Marcote and colleagues’ results argue against 
that possibility, and the timescales for quasar 
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variability are more likely to be days to months 
than milliseconds. 

Rates of FRB discovery and host-galaxy 
identification are likely to greatly increase 
as more detectors come online. Results from 
CHIME are widely anticipated, and facilities 
suchas the MeerKAT radio telescope in South 
Africa and the Five-hundred-meter Aperture 
Spherical radio Telescope (FAST) in China 
will soon come online. Do not be surprised if 
there is more than one way to generate FRBs — 
without the need for aliens. 
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A primary goal of human genetics is to identify DNA sequence variants that influence 
biomedical traits, particularly those related to the onset and progression of human 
disease. Over the past 25 years, progress in realizing this objective has been 
transformed by advances in technology, foundational genomic resources and 
analytical tools, and by access to vast amounts of genotype and phenotype data. 
Genetic discoveries have substantially improved our understanding of the 
mechanisms responsible for many rare and common diseases and driven 
development of novel preventative and therapeutic strategies. Medical innovation 
will increasingly focus on delivering care tailored to individual patterns of genetic 


predisposition. 
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For almost all human diseases, individual susceptibility is, to some 
degree, influenced by genetic variation. Consequently, characterizing 
the relationship between sequence variation and disease predisposi- 
tion provides a powerful tool for identifying processes fundamental 
to disease pathogenesis and highlighting novel strategies for preven- 
tion and treatment. 

Over the past 25 years, advances in technology and analytical 
approaches, often building on major community projects—such as 
those that generated the human genome sequence! and elaborated 
on that reference to capture sites of genetic variation” °—have enabled 
many of the genes and variants that are causal for rare diseases to be 
identified and enabled a systematic dissection of the genetic basis of 
common multifactorial traits. There is growing momentum behind 
the application of this knowledge to drive innovation in clinical care, 
most obviously through developments in precision medicine. Genomic 


medicine, which was previously restricted to a few specific clinical 
indications, is poised to go mainstream. 

This Review charts recent milestones in the history of human disease 
genetics and provides an opportunity to reflect on lessons learned by 
the human genetics community. We focus first on the long-standing 
division between genetic discovery efforts targeting rare variants 
with large effects and those seeking alleles that influence predispo- 
sition to common diseases. We describe how this division, with its 
echoes of the century-old debate between Mendelian and biometric 
views of human genetics, has obscured the continuous spectrum of 
disease risk alleles—across the range of frequencies and effect sizes— 
observed in the population, and outline how genome-wide analyses 
in large biobanks are transforming genetic research by enabling a 
comprehensive perspective on genotype-phenotype relationships. 
We describe how the expansion in the scale and scope of strategies 
for enumerating the functional consequences of genetic variation 
is transforming the torrent of genetic discoveries from the past dec- 
ade into mechanistic insights, and the ways in which this knowledge 
increasingly underpins advances in clinical care. Finally, we reflect on 
some of the challenges and opportunities that confront the field, and 
the principles that will, over the coming decade, drive the application 
of human genetics to enhance understanding of health and disease 
and maximize clinical benefit. 
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Rare (monogenic) disease 


Genes harbouring variants causal for monogenic disease 
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Fig.1| Growthin the discovery of disease-associated genetic variation. The 
cumulative numbers of genes harbouring variants causal for rare, monogenic 
diseases and traits and of significant GWAS associations implicated in 
common, complex diseases and traits are shown. Left, the advent of high- 
throughput sequencing technologies and availability of reference genomes 
from diverse populations has supported a fourfold increase in the discovery of 
rare disease-causing genes between 1999 and 2019. Right, international efforts 
suchas the Human Genome Project and the HapMap Project, combined with 


Rare diseases, rare variants 


During the 1980s and 1990s, efforts to map disease genes were focused 
on rare, monogenic and syndromic diseases and were mostly driven 
by linkage analysis and fine mapping within large multiplex pedi- 
grees. Localization of genetic signals was typically followed by Sanger 
sequencing of the genes found to map within the linked locus to iden- 
tify disease-causing alleles. Assessments of pathogenicity, based on 
segregation of a putatively causal variant with disease across multiple 
families and evidence that the risk genotype was absent in healthy 
individuals, were typically followed by confirmatory functional stud- 
ies in cellular and animal models. This path to gene identification was 
laborious; nevertheless, by 2000, around 1,000 of the estimated 7,000 
single-gene inherited diseases had been characterized, including many 
with substantial biomedical impact, such as Huntington’s disease and 
cystic fibrosis’ °. 
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Common (complex) disease 


Genome-wide significant association signals 
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sequence data for complex traits’ 2016 
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GWAS and sequencing studies, have supported identification of more than 
60,000 genetic associations across thousands of human diseases and traits. 
Centre, more recent developments have brought a synthesis of the rare- and 
common-variant approaches based around the combination of sequence- 
informed analyses in large cohorts. Key events contributing to these themes 
are depicted in the timeline. GA4GH, Global Alliance for Genomics and 
Health!°; ExAC, Exome Aggregation Consortium®. 


Completion of the draft human genome sequence’ reduced many 
of the obstacles to disease-gene mapping and propelled a fourfold 
increase in the genes implicated as causal for rare, single-gene dis- 
orders (Fig. 1). Microarray-based detection of structural variation” 
and exome- and genome-wide sequencing”’” have been pivotal, bol- 
stered by in silico analysis and prioritization of the discovered genetic 
variants. Increasing availability of reference datasets cataloguing 
population genetic variation across diverse ethnic backgrounds has 
supported robust causal inference”*>*. More recently, the adoption of 
high-throughput sequencing technologies has enabled the full range 
of causal genetic variation, from single mutations to large structural 
rearrangements, to be identified ina single assay. These technologies 
have extended from research into clinical usage, driving earlier and 
faster diagnosis for genetic disorders. 

Reduced reliance on multiplex pedigrees in favour of collections of 
affected cases, often with parents”, has proven decisive in identifying 


new dominant disorders, many of which were previously considered 
recessive”. Increasingly, discovery of rare disease genes has transi- 
tioned from genetic characterization of small numbers of individu- 
als with similar clinical presentations to genome-wide sequencing of 
larger cohorts of phenotypically diverse patients. This genotype-driven 
approach has revealed new disorders associated with more variable 
clinical presentation”**. 

Amore systematic approach to data sharing has been critical, both 
for the characterization of new disorders and diagnostic interpretation 
of potential causal alleles. The value of sharing genetic and phenotypic 
data from those thought to harbour rare undiagnosed genetic diseases 
has fostered global collaborative networks (for example, Matchmaker 
Exchange, DECIPHER and GeneMatcher) designed to match patients 
with similar genetic variants and/or phenotypic manifestations, even 
across continents”. Interactions between researchers and families 
with rare disease have enabled natural history studies to be driven 
by family support groups positioned to initiate data collection from 
patient cohorts once a causal gene is discovered”°. 

Clinical translation of these technologies has benefited froma series 
of information resources, including open databases of genes associated 
with rare disorders (for example, OMIM and ORPHANET)”, clinically 
interpreted variants (for example, ClinVar and ClinGen)”*” and patient 
records (for example, DECIPHER and MyGene2 (https://mygene2.org/ 
MyGene2))”. Access to resources that catalogue genetic variation 
across populations (such as ExAC and its successor gnomAD)** has 
enabled the confident exclusion of genetic variants too common in 
population-level data to be plausible causes of rare, penetrant early- 
onset genetic diseases”. These analyses have reduced the contamina- 
tion of databases with variants erroneously interpreted as causal for 
disease, and are addressing the overestimation of disease penetrance 
arising from the historical focus on multiplex pedigrees”. Improved 
recognition of the variable penetrance of many ‘monogenic’ disease 
alleles has invigorated efforts to identify the genetic and environmental 
modifiers responsible”*”’. 

Although huge strides have been made in associating specific genes 
with particular disorders, establishing the causal role of individual vari- 
ants within those genes remains problematic, and many patients with 
suspected rare genetic diseases are left without a definitive diagnosis”®. 
Even for variants with established causality, the penetrance is often 
unclear. Resolving these uncertainties represents the central chal- 
lenge for the field. Aggregation of sequencing data from large numbers 
of affected cases and population reference samples will provide the 
evidence base required for robust interpretation of variants. Highly 
parallelized in vitro cellular assays that allow assessment of the func- 
tional effects of all variants in a disease-associated gene can transform 
interpretation of novel variants”, although developing well-calibrated 
functional assays predictive of pathogenicity for all disease genes rep- 
resents a daunting prospect. Direct functional genomic exploration 
of accessible and disease-relevant tissues from patients using RNA 
sequencing and DNA methylation assays" can identify previously 
cryptic causal genetic variants, particularly in under-explored regions 
outside protein-coding genes’. Developments in each of these areas 
will extend the range of variants and genes for which diagnostic and 
prognostic clinical information can be provided to patients and their 
families. 


Common diseases, common variants 


Efforts to apply the approach—linkage analysis in multiplex pedi- 
grees—that had been so successful for the high-penetrance variants 
responsible for Mendelian disease were, with notable exceptions** *°, 
largely unsuccessful for common, later-onset traits with more complex 
multifactorial aetiologies, such as asthma, diabetes and depression. 
Recognition that association-based methods, focused on detecting 
phenotype-related differences in variant allele frequencies might have 


greater traction for identifying less penetrant common alleles redi- 
rected attention to analysis of case-control samples”. However, initial 
efforts targeting variants within ‘candidate’ genes were plagued by 
inadequate power, unduly liberal thresholds for declaring significance 
and scant attention to sources of bias and confounding, resulting in 
overblown claims and failed replication. 

Systematic efforts to characterize genome-wide patterns of 
genomic variation, initially through the HapMap Consortium’, 
proved catalytic, demonstrating that the allelic structure of the 
genome was segmented into haplotype blocks, each containing sets 
of correlated variants. Recognition that this configuration could 
support genome-wide surveys of association energized the tech- 
nological innovation—in the form of massively parallel genotyping 
arrays—to make such studies possible (Fig. 1). Early wins in acute 
macular degeneration’® and inflammatory bowel disease” were 
encouraging, and progress on several fronts—expansion of study 
size, denser genotyping arrays, novel strategies for imputation, 
attention to biases and appropriate significance thresholds—deliv- 
ered robust associations across a range of diseases*°. Most variants 
uncovered by these early genome-wide association studies (GWAS) 
were common, with more subtle effects than many had anticipated. 
Ahost of trait-specific consortia formed, covering diverse dichoto- 
mous and quantitative phenotypes, to accelerate genetic discovery 
through the aggregation and meta-analysis of data from multiple 
GWAS studies* **. Many tens of thousands of robust associations 
were identified**. Recently, increased access to exome and whole- 
genome sequence data has, through both direct association analy- 
sis*° and imputation**, extended discovery to low-frequency and 
rare alleles previously inaccessible to GWAS. 

In the decade since the first GWAS, understanding of the genetic 
basis of common human disease has been transformed. The disparity 
between the observed effects of the variants first identified by GWAS 
and estimates of overall trait heritability (the ‘missing heritability’ 
conundrum) is now largely resolved*”. Common diseases are not simply 
aggregations of related Mendelian conditions: for most complex traits, 
genetic predisposition is shared across thousands of mostly common 
variants with individually modest effects on population risk". 

Although the collective contribution of low-frequency and rare risk 
alleles to overall trait variability appears modest compared with that 
attributable to common variants*’, the rare risk alleles detected in 
current sample sizes necessarily have large phenotypic effects and 
are proportionately more likely to be coding, enhancing their value 
for biological inference. Founder populations (such as those from 
Finland and Iceland) have provided multiple examples of otherwise 
rare risk alleles driven to higher frequency locally through drift and/ 
or selection” ©. In addition, studies in populations with high rates of 
consanguinity make it possible to identify individuals homozygous for 
otherwise rare loss-of-function alleles, the basis for a ‘human knockout’ 
project to systematically investigate the phenotypic consequences of 
gene disruption in humans”. 

For most diseases, large-scale GWAS-aggregation efforts have been 
disproportionately powered by information from individuals of Euro- 
pean descent®. Whereas patterns of genetic predisposition appear 
broadly similar across major population groups and many common 
risk alleles discovered in one population group are detectable in oth- 
ers, allele frequencies can vary substantially; extending GWAS and 
sequencing studies to diverse populations will surely generate a rich 
harvest of novel risk alleles. 

The relative contributions of common and rare variants indicate 
that, for many traits, particularly those with post-reproductive onset, 
purifying selection has had only limited effect*>**. For a few risk alleles, 
hallmarks of balancing selection reflect increased carrier survival, 
usually through protection from infectious diseases. This includes 
well-known examples of alleles maintained at high frequency in popula- 
tions of African descent’. 
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While the extensive linkage disequilibrium within human populations 
has been essential to discovery in GWAS, high correlation between 
adjacent variants frustrates mapping of the specific variants respon- 
sible for these associations. Increasing sample size, improved access 
to trans-ethnic data, and more representative imputation reference 
panels? provide a path to improved resolution of the causal variants” 
and clues to the molecular mechanisms through which they operate. 
Functional interpretation is easiest for causal variants within cod- 
ing sequences; however, most common disease-risk variants map to 
noncoding sequences, and are presumed to influence predisposition 
through effects on transcriptional regulation. In these cases, mechanis- 
tic inference depends on connecting association signals to their down- 
stream targets (see below). For many traits, there is clear convergence 
between common-variant association signals and genes implicated in 
monogenic forms of the same disease, as well as enrichment of GWAS 
signals in regulatory elements specifically active in cell types consistent 
with known disease biology. This provides reassurance that, even 
as the number of association signals for a given disease proliferates, 
the genetic associations uncovered will coalesce around molecular and 
cellular processes with a core role in pathogenesis”. 

Importantly, the signals discovered by GWAS have revealed many 
unexpected insights into the biological basis of complex disease. 
Examples include the role of complement in the pathogenesis of 
acute macular degeneration’, synaptic pruning in schizophrenia“ and 
autophagy in inflammatory bowel disease®. In addition, as inherited 
sequence variation is a prominent cause of phenotypic variation (but 
the reverse is not true), risk variants identified by GWAS have value as 
genetic instruments, mapping causal relationships between traits and 
inferring contributions made by circulating biomarkers and environ- 
mental exposures to disease development. 

As described below, findings from GWAS have increasing transla- 
tional impact through identification of novel therapeutic targets”, 
prioritization (and deprioritization) of existing ones and development 
of polygenic scores that quantify individual genetic risk”. 


Comprehensive genotype-phenotype maps 


The historical division of disease-gene discovery into monogenic and 
polygenic strands arose from development and implementation of 
analytical approaches—family-based linkage and case-control associa- 
tion*’”—that are best-suited for detecting particular subsets of causal 
alleles. This obscured the true state of nature, with disease-risk alleles 
being distributed across a continuous spectrum of frequencies and 
effect sizes. In addition, the trait- and disease-specific perspective of 
early GWAS discovery (mostly reliant on case-control studies) was 
poorly equipped to investigate the contribution of genetic variants 
to phenotypic effects that are nested within or spread across classical 
disease definitions. Recent developments have enabled a more holistic 
perspective on genotype-phenotype relationships (Fig. 1). 

One major advance has been the increasing availability of large pro- 
spective population-based cohorts. These biobank efforts, pioneered 
in studies suchas the Framingham Cohort” and the efforts of DeCODE 
inIceland”’””, now encompass a growing inventory of national cohorts 
in North America, Europe, Asia and beyond”. The UK Biobank study, 
including 500,000 largely healthy, middle-aged participants has been 
particularly influential, transforming human genetic research in part 
through permissive data-sharing policies that have allowed multiple 
research groups to analyse the data”. Efforts to make clinical data 
embedded in electronic health records and registries available for 
research””’”’ mean that biobanks increasingly provide access toa wide 
range of demographic, clinical and lifestyle data, captured in harmo- 
nized, systematic fashion from large, often multi-ethnic collections 
of individuals. For millions of biobank participants, this rich pheno- 
typic information has been combined with genome-wide genetic data. 
There are nascent efforts to capture transcriptomic, proteomic and 
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metabolomic phenotypes, although these are not yet at equivalent 
scale to the genetic data’”*°. Biobank analyses have provided more 
generalizable estimates of the relevance of genetic risk factors in 
the context of the separate and joint effects of non-genetic factors®. 
Increasingly, integration with healthcare data brings a longitudinal 
dimension to phenotypic characterization, which facilitates analyses 
of disease progression and lifelong disease risk®. 

The rich phenotypic scope of these cohorts has enabled variants of 
interest to be interrogated for associations across the gamut of avail- 
able phenotypes. These phenome-wide association studies (PheWAS) 
have revealed the extent to which many variants have pleiotropic effects 
across multiple traits®. Some of these relationships are expected, such 
as the impact of obesity variants on risk of hepatic steatosis andtype2 
diabetes®™ or variants that influence multiple autoimmune conditions®. 
Others connect diseases and traits in surprising ways, highlighting 
shared polygenic, pleiotropic effects and cell-type specificity, and deliv- 
ering insights into shared biology and overlapping mechanisms®***”, 
These findings inform the prioritization of therapeutic targets, pro- 
viding clues to potential on-target side effects and opportunities for 
drug repurposing® ®*. 

The second enabler of inclusive, systematic analysis of genotype- 
phenotype relationships has been access to whole-genome sequence 
data. The scale of genetic analysis based on sequence data still lags 
behind that of genome-wide genotyping data (the largest sequence- 
based datasets are one tenth the size of the largest GWAS” *”), although 
reductions in sequencing costs are decreasing the differential. Most 
direct analysis of high-throughput sequence data has focused on the 
coding regions. Strategies for assigning variant function and jointly 
analysing sets of variants of similar functional effect have enabled 
aggregate, gene-level tests of rare functional-variant association that 
are often better powered than single-variant tests”’**. However, the 
principal benefit to date of whole-genome sequence data to genetic 
discovery has been to bolster array-based access to lower-frequency 
alleles, either directly, through their inclusion on genotyping plat- 
forms, or indirectly, through imputation from sequence-based refer- 
ence samples**. 

These developments have enabled researchers to bridge the gap 
between the monogenic and polygenic realms, identifying common 
variant modifiers of monogenic phenotypes contributing to the vari- 
able expression of rare, large-effect alleles”*”, and low-frequency and 
rare variants that influence common multifactorial traits. This ena- 
bles more rigorous evaluation of the contribution of rareand common 
variants to trait susceptibility** and supports the enumeration of ‘allelic 
series’ (sets of alleles of varying frequency, effect size and direction that 
disrupt the same gene) critical for studies of disease mechanism and 
therapeutic target optimization®**. These developments are rapidly 
converging towards the ultimate destination: a comprehensive matrix 
of the effect of all observable genetic variants across the widest pos- 
sible range of cross-sectional and longitudinal biomedical phenotypes. 
Success in this endeavour depends on ever greater harmonization 
between, and integration of results from, individual studies through 
sustained investments in data sharing. 


Adding function 


From the first linkage maps to whole-genome sequencing of large 
cohorts, human genetics has deployed increasingly sophisticated and 
inherently systematic approaches for mapping the genetic factors that 
underlie traits and diseases. However, progress in determining how 
these variants influence disease, through systematic interrogation 
of their functional effects on molecular, cellular and physiological 
processes, has been far slower. 

For monogenic diseases, for which the alleles responsible are typi- 
cally rare, penetrant and coding, genetic approaches have generally 
been both necessary and sufficient to implicate a gene as causal’”®, 


However, as efforts to elucidate the genetic basis of Mendelian disor- 
ders progress towards completion”, functional studies remain impor- 
tant to understand the mechanisms by which disruptive variation within 
a causal gene leads to disease phenotypes. Unlike common diseases, 
the clarity of causation for Mendelian disorders usually simplifies the 
task of generating models (including human cells and organoids or 
rodents) to connect genotype to organismal phenotype; these have 
led to many critical insights into the biology of health and disease in 
humans””?. In addition, for genes harbouring variants with medically 
actionable consequences (as with the BRCAI and BRCA2 mutations that 
are causal for early-onset breast and ovarian cancer), functional studies 
can support the translational interpretation of novel alleles identified 
by medical sequencing”. 

For common diseases, functional studies have a more fundamental 
role. Although tens of thousands of associations have been discovered 
across thousands of common human diseases and traits“*, multiple 
factors have frustrated efforts to convert these genetic signals to knowl- 
edge about causal variants, genes and mechanisms. For the common 
variants that underlie the bulk of complex-disease risk, the resolution 
of association mapping is often limited by the haplotype structure of 
the human genome? *. Furthermore, most GWAS associations map to 
the noncoding genome and thus lack a direct address to the gene that 
mediates their effects. Growing appreciation of the pervasive role of 
pleiotropy complicates matters: many variants identified by GWAS 
are associated with multiple traits and exert diverse effects across 
multiple cell types’. 

To date, relatively few studies have achieved the goal of connecting 
variants causal for complex traits to the molecular and cellular func- 
tions that mediate that predisposition. One early success described 
how regulatory variants that modulate SORTI expression influence 
low-density lipoprotein cholesterol and myocardial infarction risk’. 
More recent examples have focused on the relationship between obe- 
sity-associated variants intronic to F7O, altered expression of /RX3 and 
IRXS5, and adipocyte’™ and hypothalamic'™ function. Similar functional 
descriptions have been reported for individual lociimplicated in schizo- 
phrenia®, cardiovascular disease’, type 2 diabetes’ and Alzheimer’s 
disease’”°, among others. 

Over the past decade, the challenge for the functional genomics 
community has been to convert this ‘one-locus-at-a-time’ workflow to 
asystematic, multidimensional, integrative approach able to deliver 
genome-scale functional analyses to match genome-wide variant discov- 
ery (Fig. 2). Atthe molecular level, one cornerstone has been generation of 
genome-wide catalogues of functional activity. For example, the ENCODE 
and Roadmap Epigenomics projects have generated maps of histone 
modifications, transcription-factor binding, chromatin accessibility, 
three-dimensional genome structure and other regulatory annotations 
across hundreds of cell types and tissues °°. The patterns of genomic 
overlap between these data and GWAS results enable the functional infer- 
ence of risk variants, deliver clues tothe specific cell types driving disease 
pathogenesis” and accelerate locus-specific mechanistic insights. 

In parallel, there has been a scaling of efforts to connect trait-associ- 
ated regulatory variants to the genes and processes that they regulate 
in cell types relevant to the disease of interest"°™. For example, the 
GTEx (Genotype-Tissue Expression) consortium has mapped thou- 
sands of expression quantitative trait loci (QTLs) across hundreds of 
individuals and dozens of tissues". Further clues to the relationships 
between regulatory variants and their effector genes can be gathered 
from DNA proximity assays (suchas Hi-C) and single-cell data"? (Fig. 2). 
Programs such as HubMAP™ and the Human Cell Atlas” are set to 
deliver comprehensive, high-resolution reference maps of individual 
human cell types across diverse developmental stages, providing new 
opportunities to understand how regulatory genetic variation results 
in cellular and organismal phenotypes. 

Efforts to probe the clinical consequences of coding alleles with large 
phenotypic effects (particularly null alleles) in humans?‘ and across 


diverse animal models” represent powerful strategies for extending 


functional analyses to the whole-body level. Connections between 
genetic variation and circulating proteomic and metabolomic data 
provide additional mechanistic links between cellular events and 
whole-body physiology’””®°. These efforts are paralleled by PhewAS 
approaches®, which, by mapping variant effects across the range of 
traits available in biobanks and EMRs, can inform priors for cell types 
and pathways at individual loci. Importantly, whereas early studies 
typically linked GWAS risk alleles to data froma single functional assay, 
the focus is increasingly on maximizing biological insight through the 
multi-dimensional integration of multiple genome-wide data types 
using approaches suchas heritability partitioning’”, functional enrich- 
ment analyses™””, integration of the three-dimensional genome struc- 
ture”’® and deep convolutional neural networks”?”°, 

Although QTL analyses can implicate a haplotype in a molecular, 
cellular or organismal phenotype, they are, in isolation, insufficient 
to define the specific causal variants responsible. To address this, 
there has been rapid maturation of technologies, such as massively 
parallel reporter assays”? and CRISPR genome editing, to support 
functional characterization of targeted sequence perturbations at 
scale. Variations on these methods enable the functional evaluation 
of genes (via knockout screens™), regulatory elements (using CRISPR 
interference and CRISPR activation screens”>’), and genetic variants 
(base editors’”’) at increasing scale and resolution”. Combined with 
complex readouts—including high-content imaging” and single-cell 
transcriptomics and epigenomics”’”°—these methods can generate 
empirical ‘truth’ data, supporting the development of in silico models 
to predict causal variants, effector transcripts” and cellular effects. In 
due course, such models should reduce the need for exhaustive experi- 
mental characterization of function for all variants across all cell types. 

The goal of such efforts is to enumerate the cascade of molecular 
events that underlie observed genotype-phenotype associations using 
physiologically relevant cellular systems (from primary cells to orga- 
noids and ‘organ-on-chip’ designs) and whole-body assays appropriate 
to the disease of interest. Collectively, strategies that offer large-scale 
functional evaluation of variants and genes of interest will reduce (but 
probably not eliminate) the intensive effort required for ‘final mile’ 
validation of disease mechanisms in dedicated systems, thereby accel- 
erating downstream translational application. 


Clinical implementation 


Medical genetics, as applied to rare diseases, has been characterized 
by the rapid application in the clinic of the transformative genomic 
technologies that drove initial research discoveries. There are now 
targeted genetic tests for nearly all clinical presentations attributable 
tolarge-impact alleles, alongside more extensive genome-sequencing 
assays that, when necessary, enable interrogation of a longer list of 
relevant genes. Genetic testing for symptomatic individuals and at-risk 
relatives occurs routinely in many medical specialties. In parallel, the 
use of somatic cancer testing has increased as therapies targeted to 
specific mutational events have entered clinical practice (these devel- 
opments are reviewed elsewhere”). 

For patients with symptoms that indicate a probable monogenic aeti- 
ology (suchas retinal degeneration, hearing loss or cardiomyopathy), 
targeted panels are typically the platform of choice’, although they 
are increasingly performed ona more extensive sequence backbone. 
For more complex phenotypes—those without a clear match to a spe- 
cific syndrome, such as neurodevelopmental disorders and multiple 
congenital anomalies—testing has gravitated towards early deploy- 
ment of exome and genome-sequencing platforms that offer speedy 
resolution of what has historically often been a traumatic diagnostic 
odyssey’*. The power of genomic diagnosis is especially clear for 
those presenting with monogenic neurodevelopmental disorders and 
critically ill infants’>3°, Sequencing of the parent-offspring trio can 
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Fig. 2| Genetic discovery is paralleled by advances in functional genomics 
technologies. Top, the growth inthe number of genetic loci associated by 
GWAS with human traits and diseases (bars) and of variant-to-function studies 
(area under line, not to scale). Bottom, foundational technological and 
computational advances over the last decade that enabled (1) development 

of systematic, genome-wide catalogues of functional elements across 
multiple cell types and tissues (blue); (2) mapping of QTLsin the context 

of gene expression, metabolites, proteins and regulatory elements (red); 

(3) engineering of genes, genetic elements and genetic variation at increasing 
scale (orange); and (4) systematic tissue-specific surveys of regulatory 


detect de novo variation in dominant disorders and phase biallelic rare 
variants in recessive disease”. 

The transition from targeted gene tests to genomic sequencing 
enables recursive reanalysis, including reinterpretation of individual 
sequences on the basis of subsequent discoveries regarding causal dis- 
ease alleles and their phenotypic consequences”. However, improved 
molecular diagnostics are required to ensure reliable detection of a 
subset of genetic disorders, including those arising from triplet repeats 
and complex rearrangements’*. Deep sequencing of affected tissues 
for mosaic variants and the use of RNA sequencing to detect noncoding 
variants that drive early-onset disease (for example, through effects 
on splicing) represent new fronts for clinical diagnostics”. 

Other examples of the rapid adoption of new genomic technologies 
include noninvasive prenatal testing (more than ten million tests by 
2018 across multiple countries!” “’) and the use of recessive carrier 
panels for couples planning pregnancies. Newborn screening is now 
universal in many countries, although it is limited to disorders combin- 
ing high-throughput low-cost detection with effective early interven- 
tions (such as diet restrictions or enzyme replacement)”. Genetic 
diagnostics are also increasingly applied to newborn screening as a 
reflex test following an abnormal (for example, metabolic) screening 
test’?. Over the next decade, the repertoire of disorders captured by 
neonatal screening and prenatal testing is likely to expand markedly. 
Whereas prenatal testing may be more effective at avoiding disease, 
the associated ethical issues are more complex™. 

Although genetic testing for rare disease and cancer has exploded, 
there has been more limited uptake of genetic information in other 
aspects of healthcare. For example, despite multiple examples of clini- 
cally important genetic markers related to drug efficacy and side-effect 
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profile’, the roll-out of pharmacogenetics has been hampered by a 
range of factors, including lack of clinical decision support in electronic 
medical systems to guide the drug choice or dosing by the physician. 
This has been compounded by challenges in diagnostic testing: com- 
plex haplotype structures and structural variants at some key drug 
metabolism loci necessitate genome sequencing or specific targeted 
panels to detect all clinically relevant variants. 

For common diseases, translational attention is currently focused 
on the clinical potential of polygenic risk scores. The development 
of robust polygenic scores for several common diseases has been 
catalysed by more precise per-variant effect estimates from larger 
GWAS datasets, improved algorithms for combining information 
across millions of single-nucleotide polymorphisms, and large-scale 
biobanks that support score validation®“*”, For example, a genome- 
wide polygenic score for heart attack, incorporating 6.6 million vari- 
ants, indicates that 5% of European-descent individuals have a risk 
of future cardiac events equivalent to that seen in those with less 
frequent monogenic forms of hypercholesterolaemia™. Increasingly, 
the shift from array-based genotyping to sequence-based analysis is 
facilitating risk prediction, which integrates information from rare, 
large-effect alleles with that from polygenic scores”. By improving the 
capture of genetic risk, particularly innon-European populations, and 
integrating environmental and biomarker data to quantify aspects of 
non-genetic risk, it should be possible to achieve increasingly accurate 
prediction of individual disease risk, and to use this information to 
tailor screening, prevention and treatment. Success will depend on 
developing models of risk that robustly integrate these diverse data 
types and on optimizing the strategies deployed to ensure effective 
implementation. 


Table 1| Examples showing progress in the clinical utility of genetic testing 


Mode of clinical utility Disorder Gene(s) Treatment or action 
Targeted therapeutics Cystic fibrosis CFTR (p.G551D) lvacaftor 
Retinal dystrophy RPE65 Voretigene neparvovec-rzyl 
Fabry disease GLA (specific variants) Migalastat 
Spinal muscular atrophy SMN1 Nusinersen 
Familial hypercholesterolaemia PCSK9 Evolocumab and alirocumab 
Altering diet or behaviour Phenylketonuria PAH Phenylalanine-free diet 
Hereditary fructose intolerance ALDOB Elimination of fructose and sucrose from diet 
Biotinidase deficiency BTID Biotin vitamin supplements 
Usher syndrome MYO7A and others Avoid cycling and other activities that depend 
on balance and peripheral vision 
Prophylactic interventions Sickle-cell disease HBB*® Antibiotics and pneumococcal vaccine 
Brugada syndrome SCND5A Implantable cardioverter-defibrillator 
Hereditary breast or ovarian cancer BRCA1 or BRCA2 Bilateral salpingo-oophorectomy 
Multiple endocrine neoplasia 2 RET Thyroidectomy 
Increased surveillance Lynch syndrome MSH2 and others Early initiation of colonoscopy 
Marfan syndrome FBN1 Echocardiography 
Haemochromatosis HFE Iron screening 
Alpha-1 antitrypsin deficiency AAT Spirometry, liver function tests 


There is an increasing number of examples in which a specific molecular diagnosis results in downstream actions to improve patient outcomes. These actions can range from a change in diet 
or behaviour to prophylactic surgery. For each example shown, we provide the disorder, the gene and corresponding actions. The GeneReviews compendium (https://www.ncbi.nim.nih.gov/ 
books/NBK1116/) provides up-to-date information on the management of these and other Mendelian disorders. 


The absence of evidence-based guidelines to support healthcare 
recommendations continues to hinder the clinical applications of 
genetic data. In some countries, this is compounded by confusion 
over reimbursement and disparities in testing across society®. Many 
healthcare professionals lack experience in genomic medicine and 
need education and guidance to practice in the rapidly evolving 
space of genetic and genomic testing’. One consequence of these 
difficulties has been an expanding direct-to-consumer testing mar- 
ket, variably controlled by country-specific regulations’, which 
is moving beyond a focus on ancestry and personal traits, towards 
models in which individuals have direct access to ordering physicians 
and genetic counselors’. The risk of commercial influence in this 
model remains high. There are concerns about the consequences 
of unfettered release of genetic data of dubious or inflated clinical 
relevance, and limited infrastructure to pull these results into main- 
stream medical systems. 

These advances have fostered debate about the value of genetics 
for population screening, for both monogenic and complex disorders. 
Population screening for monogenic disorders is most likely to be 
initiated for conditions for which risk estimates are well-understood 
and there are actionable interventions (for example, Lynch syndrome 
and familial hypercholesterolaemia). Expansion to other disorders 
requires better understanding of the penetrance of pathogenic alleles 
in unselected populations” and caution before extending screening to 
longer lists of genes that are less securely implicated in disease causa- 
tion’’. As certain countries consider universal capture of genome-wide 
genetic data at birth or later in life, key questions concern the strategies 
for releasing this information to citizens and their medical teams to 
support individual healthcare. 

Ultimately, barriers to genomic medicine are most directly over- 
come by demonstrating clinical utility in disease management and 
therapeutic decision-making, with evidence for improved patient out- 
comes. Hereditary cancers provide multiple examples, suchas the use 
of BRCA1/BRCA2 testing to inform PARP inhibitor treatment in patients 
with cancer’. There is a growing list of diseases for which a molecular 
diagnosis results in specific interventions designed toimprove patient 
outcomes (https://www.ncbi.nlm.nih.gov/books/NBK1116/) (some 


examples are listed in Table 1), and there are currently more than 50 
FDA-approved drugs for genetic disorders’. Although gene therapy 
has been slow to evolve since its early introduction, recent advances 
in gene editing are reinvigorating approaches to treat disorders by 
manipulation of the underlying genetic defects’. 


Looking forward 


Over the coming decade, the challenge will be to optimize and to imple- 
mentat scale, strategies that use human genetics to further the under- 
standing of health and disease, and to maximize the clinical benefit 
of those discoveries. Realizing these goals will require the concerted 
effort of researchers in academia and industry to bring about transfor- 
mational change across a range of highly interconnected domains, for 
example, through the auspices of the recently established International 
Common Disease Alliance (https://www.icda.bio). Such efforts will 
be directed towards establishing: (a) comprehensive inventories of 
genotype-phenotype relationships across populations and environ- 
ments; (b) systematic assays of variant- and gene-level function across 
cell types, states and exposures; (c) improved scalable strategies for 
turning this basic knowledge into fully developed molecular, cellular 
and physiological models of disease pathogenesis; and (d) application 
of those biological insights to drive novel preventative and therapeutic 
options. 

The first of these will involve documenting the full spectrum of natu- 
ral genetic variation across all human populations, including capture 
of structural variants, and somatic mutations that accumulate with 
aging!”® and associating these variations with the ever-richer disease- 
related intermediate and clinical traits available through biobanks and 
electronic health records. It will be particularly important to include 
populations historically under-represented in genomic research, fol- 
lowing the pioneering work of the H3Africa consortium’. As over 
time, clinically sequenced genomes will outnumber those collected in 
academia, research and healthcare communities will need to developa 
harmonized approach to genomics to transcend historical boundaries. 
Progress will be critically dependent on platforms and governance that 
lower barriers to the integration of genetic and phenotypic data across 
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Box 1 


Global genomics 


Present and future advances in genetics and genomics have the 
potential to provide benefits to individuals and societies across the 
world, but equitable and fair access to those benefits will require 
proactive measures to address entrenched disparities in scientific 
capacity and clinical opportunities. This includes: 

(1) Global characterization of genetic variation. Systematic 
catalogues of human genome variation from a broad range of 
global populations will maximize the genetic diversity available for 
genetic discovery and clinical implementation. This will support 
more accurate imputation, enable more effective use of polygenic 
scores, and improve detection and interpretation of rare large- 
effect alleles. 

(2) Genetic discovery for diseases with restricted geographical 
coverage. Genetic studies conducted in diverse populations will 
support genetic discovery as well as functional characterization of 
hitherto-neglected diseases with disproportionate regional impact 
and may lead to novel preventative and therapeutic strategies. 

(3) An ethical framework for global research. Genetic research 
needs to proceed on the expectation that the benefits of research 
will be available to those who have participated, and that 
relationships between researchers and patients are based on 
robust expressions of accountability and governance. Participation 
should be based on appropriate informed individual consent in the 
context of community involvement, with mechanisms to support 
equitable data sharing with researchers in participating countries. 

(4) Support for local research capacity. Collection of biological 
specimens and data are dependent on adequate support for data 
collection, generation and analysis in participating countries, with 
infrastructure, training and career structures in place to support 
local researchers and equitable credit for shared research outputs. 

(5) Equitable translation. Participants and their communities 
should have fair and equitable access to the biomedical 
information arising from the studies in which they are involved 
to support population-relevant clinical diagnostics and 
implementation (for example, of polygenic scores). There 
should be commitment to the development of technologies and 
strategies that enable the clinical benefits of genomic medicine to 
be accessible to those in less-developed healthcare environments. 


studies and countries, along with technical standards that are reliable, 
secure and compatible with the international regulatory landscape’. 

Mechanistic interpretation of genetic associations, particularly 
those in regulatory regions, will be driven by the systematic anno- 
tation of sequence variants and genes for functional impact across 
disease-relevant cell types, enabling mapping of processes contrib- 
uting to disease development with respect to place (tissue and cell 
type), time (developmental stage) and context (external influences)’. 
Accelerating efforts to characterize the cellular composition of tis- 
sues through single-cell assays’ will increase the granularity of these 
observations. Large-scale perturbation studies across diverse cellular 
and animal models will, together with analyses of coding variants in 
humans**™, provide confidence in causal inference. Large-scale pro- 
teomic and metabolomic analyses (in tissues and biological fluids) 
will provide a bridge to downstream pathways’”®°. Research access 
to such functional data, generated at scale, should lower the barriers 
to mechanistic inference, provide system-wide context and enable 
researchers to focus wet-laboratory validation on the most critical 
experiments. Collectively, these efforts will support compilation of 
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a systematic catalogue of key networks and processes that influence 
normal physiology and disease development and inform a revised 
molecular taxonomy of disease. 

This knowledge will reinforce the essential contribution of human 
genetics to the identification and prioritization of targets for thera- 
peutic development®”’”. Insights into the efficacy of target pertur- 
bation and potential for adverse events, allied to characterization of 
translatable biomarkers, provide ways to boost the efficiency of drug- 
development pipelines!”. Given the clinical importance of slowing 
disease progression’, target-discovery efforts will increasingly need to 
embrace the genetics of disease progression and treatment response, 
as these may involve processes distinct from those captured by studies 
of disease onset. 

In parallel, the clinical use of human genetics will benefit from 
progress towards universal determination of individual genome 
sequences built through a combination of biobank expansion and 
direct access within healthcare systems. This will power clinical appli- 
cations that extend beyond the current focus on neonatal sequenc- 
ing, Mendelian diagnostics and somatic tumour sequencing. In 
particular, improvements in polygenic score derivation will boost 
risk prediction for multifactorial traits, provide a molecular basis for 
disease classification, support biomarker discovery and therapeutic 
optimization and contribute to understanding of the variable pen- 
etrance of monogenic conditions”. Implementing genomic medicine 
as aroutine component of clinical care across diverse healthcare envi- 
ronments will inevitably require investment in the training of health- 
care professionals and attention to optimal strategies for returning 
genetic findings to patients. 

The limited heritability of many multifactorial traits constrains the 
clinical precision available from genetic data alone. This will drive 
efforts to integrate information on personal environment, lifestyle 
and behaviour, and to combine prognostic, predictive information 
on disease risk with longitudinal measures of molecular and clinical 
state that track an individual’s journey from health to disease. Human 
genetics will also, given its unique potential for causal inference, sup- 
port identification of the non-genetic risk factors (often modifiable) 
that directly contribute to disease predisposition and development'®. 
As polygenic score performance improves, analysis of individuals who 
show marked divergence between genetic predisposition and real- 
world clinical outcomes should define exposures (such as lifestyle 
choices or gut microbiome) the contribution of which to disease cau- 
sation remains unclear’. 

Collectively, these developments can be expected to accelerate per- 
sonalization of healthcare delivery. Provided costs are sustainable, 
amore preventative perspective on health could emerge, managed 
through proactive genomic, clinical and lifestyle surveillance using risk 
scores, complex biomarkers, liquid biopsies and wearables. Improved 
understanding of aetiological heterogeneity, patterns of sharing of 
genetic risk across diseases, variation in therapeutic response and risk 
of adverse events will enhance targeting of preventative and therapeu- 
tic interventions”. At the population level, intervention strategies 
will seek to combine population-wide and targeted strategies to best 
effect’. It will be critical to ensure that these benefits are available to 
as many as possible, so that genomics reduces, rather than exacerbates, 
national and global health disparities>’” (Box 1). 

The developments described above, represent variations on the 
theme of ‘reading’ the genome. The emerging capacity to block this 
reading (for example, through siRNA therapies”) or even to ‘write’ the 
genome (through CRISPR editing) promises to be equally transforma- 
tive, providing new opportunities to correct, and even cure, Mendelian 
disease. Spectacular advances in developing novel therapeutic strate- 
gies are likely for many diseases, based, for example, on ex vivo cellular 
manipulation” or in vivo somatic cell editing”. 

Importantly, developments in genomic medicine need to proceed 
ina bioethical framework for research and clinical use that recognizes 


the personal relevance of human genetics and the critical importance 
of autonomous consent and the protection of privacy, while minimiz- 
ing the adverse consequences of genetic exceptionalism. Governance 
needs to reaffirm the rights of citizens to make individual contributions 
to scientific progress through research participation and encourage 
the responsible exchange of data for clinical and research purposes. 


Future prospects 


Over the past two decades, understanding of the genetic basis of human 
disease has been transformed by a combination of spectacular tech- 
nological and analytical advances, collaborative commitment to the 
development of foundational resources and the collection and analysis 
of vast amounts of genetic, molecular and clinical data. The biological 
insights derived from these data are, increasingly, drivers of transla- 
tional innovation, and widening personal access to large-scale genetic 
and molecular data promises to reshape medical care. 

However, for the full potential of genomic medicine to be realized, 
there will need to be sustained collaborative endeavour on several 
fronts to ensure that the capacity to generate ever more detailed maps 
of the relationships between sequence variation and biomedical pheno- 
types delivers acomprehensive understanding of disease mechanisms 
that can be translated into the medicines of tomorrow. 


1. International Human Genome Sequencing Consortium. Initial sequencing and analysis of 
the human genome. Nature 409, 860-921 (2001). 

This paper describes the first analyses from the draft human genome sequence 
assembled over the previous decade: it launched modern human genetics and 
represents a tribute to the power of collaborative science. 

2. — International HapMap Consortium. The International HapMap Project. Nature 426, 
789-796 (2003). 

The HapMap Consortium developed the first genome-wide maps of common sequence 
variation, using this information to lay out the haplotypic structure of this variation 
across three major ancestral groupings (from Europe, East Asia and Africa). 

3. 1000 Genomes Project Consortium. A global reference for human genetic variation. 
Nature 526, 68-74 (2015). 

4. The Haplotype Reference Consortium. A reference panel of 64,976 haplotypes for 
genotype imputation. Nat. Genet. 48, 1279-1283 (2016). 

5. Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 
285-291 (2016). 

6. Karezewski, K. J. et al. Variation across 141,456 human exomes and genomes reveals the 
spectrum of loss-of-function intolerance across human protein-coding genes. Preprint at 
bioRxiv https://doi.org/10.1101/531210 (2019). 

The most recent report from the genome aggregation database (gnomAD) project 
combining human sequence data on an unprecedented scale to characterize rare, 
high-impact variants and their relationship to health and disease. 

7. Kremer, B. et al. A worldwide study of the Huntington's disease mutation. The sensitivity 
and specificity of measuring CAG repeats. N. Engl. J. Med. 330, 1401-1406 (1994). 

8. Collins, F. S. Identifying human disease genes by positional cloning. Harvey Lect. 86, 
149-164 (1990-1991). 

9. Gusella, J. F. & MacDonald, M. E. Huntington’s disease and repeating trinucleotides. 

N. Engl. J. Med. 330, 1450-1451 (1994). 

0. Vissers, L. E. et al. Array-based comparative genomic hybridization for the genomewide 
detection of submicroscopic chromosomal abnormalities. Am. J. Hum. Genet. 73, 
1261-1270 (2003). 

1. Ng, S.B. et al. Targeted capture and massively parallel sequencing of 12 human exomes. 
Nature 461, 272-276 (2009). 

2. Wheeler, D. A. et al. The complete genome of an individual by massively parallel DNA 
sequencing. Nature 452, 872-876 (2008). 

3. Vissers, L. E. et al. Ade novo paradigm for mental retardation. Nat. Genet. 42, 1109-1112 
(2010). 

4. Deciphering Developmental Disorders Study. Prevalence and architecture of de novo 
mutations in developmental disorders. Nature 542, 433-438 (2017). 

5. Splinter, K. et al. Effect of genetic diagnosis on patients with previously undiagnosed 
disease. N. Engl. J. Med. 379, 2131-2139 (2018). 

6. Deciphering Developmental Disorders Study. Large-scale discovery of novel genetic 
causes of developmental disorders. Nature 519, 223-228 (2015). 

7. Firth, H. V. et al. DECIPHER: database of chromosomal imbalance and phenotype in 
humans using ensembl resources. Am. J. Hum. Genet. 84, 524-533 (2009). 

8. Philippakis, A.A. et al. The Matchmaker Exchange: a platform for rare disease gene 
discovery. Hum. Mutat. 36, 915-921 (2015). 

9. Boycott, K. M. et al. International cooperation to enable the diagnosis of all rare genetic 
diseases. Am. J. Hum. Genet. 100, 695-705 (2017). 

20. Kennedy, J. et al. KAT6A syndrome: genotype-phenotype correlation in 76 patients with 

pathogenic KAT6A variants. Genet. Med. 21, 850-860 (2019). 

21. Amberger, J. S., Bocchini, C. A., Schiettecatte, F., Scott, A. F. & Hamosh, A. OMIM.org: 
Online Mendelian Inheritance in Man (OMIM), an online catalog of human genes and 
genetic disorders. Nucleic Acids Res. 43, D789-D798 (2015). 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 


54. 


Landrum, M. J. et al. ClinVar: public archive of relationships among sequence variation 
and human phenotype. Nucleic Acids Res. 42, D980-D985 (2014). 

Rehm, H. L. et al. ClinGen—the Clinical Genome Resource. N. Engl. J. Med. 372, 
2235-2242 (2015). 

Minikel, E. V. et al. Quantifying prion disease penetrance using large population control 
cohorts. Sci. Transl. Med. 8, 322raQ (2016). 

Flannick, J. et al. Assessing the phenotypic effects in the general population of rare 
variants in genes for a dominant Mendelian form of diabetes. Nat. Genet. 45, 1380-1385 
(2013). 

BeGanovic, K. et al. ASNP in the HTT promoter alters NF-KB binding and is a bidirectional 
genetic modifier of Huntington disease. Nat. Neurosci. 18, 807-816 (2015). 

Castel, S. E. et al. Modified penetrance of coding variants by cis-regulatory variation 
contributes to disease risk. Nat. Genet. 50, 1327-1334 (2018). 

MacArthur, D. G. et al. Guidelines for investigating causality of sequence variants in 
human disease. Nature 508, 469-476 (2014). 

Findlay, G. M. et al. Accurate classification of BRCA1 variants with saturation genome 
editing. Nature 562, 217-222 (2018). 

This paper describes an effort to obtain massively parallel functional assessments of 
all potential sequence variants in a gene causal for familial breast cancer, and thereby 
provide more confident predictions of clinical significance for those found to carry a 
previously unseen mutation. 

Cummings, B. B. et al. Improving genetic diagnosis in Mendelian disease with 
transcriptome sequencing. Sci. Transl. Med. 9, eaal5209 (2017). 

Butcher, D. T. et al. CHARGE and Kabuki syndromes: gene-specific DNA methylation 
signatures identify epigenetic mechanisms linking these clinically overlapping 
conditions. Am. J. Hum. Genet. 100, 773-788 (2017). 

Short, P. J. et al. De novo mutations in regulatory elements in neurodevelopmental 
disorders. Nature 555, 611-616 (2018). 

Jaganathan, K. et al. Predicting splicing from primary sequence with deep learning. Cell 
176, 535-548 (2019). 

Hugot, J. P. et al. Association of NOD2 leucine-rich repeat variants with susceptibility to 
Crohn's disease. Nature 411, 599-603 (2001). 

Ogura, Y. et al. A frameshift mutation in NOD2 associated with susceptibility to Crohn's 
disease. Nature 411, 603-606 (2001). 

Barbosa, J., Chern, M. M., Noreen, H. & Anderson, V. E. Analysis of linkage between the 
major histocompatibility system and juvenile, insulin-dependent diabetes in multiplex 
families. Reanalysis of data. J. Clin. Invest. 62, 492-495 (1978). 

Risch, N. & Merikangas, K. The future of genetic studies of complex human diseases. 
Science 273, 1516-1517 (1996). 

A highly influential commentary that switched much of the complex trait genetics field 
from linkage to association and imagined GWAS several years before the technology 
made them a reality. 

Klein, R. J. et al. Complement factor H polymorphism in age-related macular 
degeneration. Science 308, 385-389 (2005). 

Arguably the first full GWAS, this study demonstrated the potential of agnostic 
genomic surveys to highlight entirely novel biology—in this case, a role for 
complement in the pathogenesis of macular degeneration. 

Duerr, R. H. et al. A genome-wide association study identifies /L23R as an inflammatory 
bowel disease gene. Science 314, 1461-1463 (2006). 

Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 
cases of seven common diseases and 3,000 shared controls. Nature 447, 661-678 
(2007). 

This paper, which describes what was, at the time, the largest GWAS yet conducted, 
demonstrated the broad applicability of the approach, and set the scene for the 
decade of GWAS discovery that followed. 

Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using 
high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505-1513 
(2018). 

Astle, W. J. et al. The allelic landscape of human blood cell trait variation and links to 
common complex disease. Cell 167, 1415-1429 (2016). 

Lango Allen, H. et al. Hundreds of variants clustered in genomic loci and biological 
pathways affect human height. Nature 467, 832-838 (2010). 

Buniello, A. et al. The NHGRI-EBI GWAS catalog of published genome-wide association 
studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005-D1012 
(2019). 

Fuchsberger, C. et al. The genetic architecture of type 2 diabetes. Nature 536, 41-47 
(2016). 

UK10K Consortium. The UK10K project identifies rare variants in health and disease. 
Nature 526, 82-90 (2015). 

Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 
747-753 (2009). 

Wainschtein, P. et al. Recovery of trait heritability from whole genome sequence data. 
Preprint at bioRxiv https://doi.org/10.1101/588020 (2019). 

Flannick, J. et al. Loss-of-function mutations in SLC30A8 protect against type 2 diabetes. 
Nat. Genet. 46, 357-363 (2014). 

Moltke, |. et al. A common Greenlandic TBC1D4 variant confers muscle insulin resistance 
and type 2 diabetes. Nature 512, 190-193 (2014). 

Nioi, P. et al. Variant ASGR1 associated with a reduced risk of coronary artery disease. 

N. Engl. J. Med. 374, 2131-2141 (2016). 

Pollin, T. |. et al. A null mutation in human APOC3 confers a favorable plasma lipid profile 
and apparent cardioprotection. Science 322, 1702-1705 (2008). 

Narasimhan, V. M. et al. Health and population effects of rare gene knockouts in adult 
humans with related parents. Science 352, 474-477 (2016). 

A powerful illustration of the insights that can be gained from the study of individuals 
who are homozygous carriers of null alleles in genes of biomedical interest. 
Saleheen, D. et al. Human knockouts and phenotypic analysis in a cohort with a high rate 
of consanguinity. Nature 544, 235-239 (2017). 


Nature | Vol577 | 9 January 2020 | 187 


Review 


55. 


56. 


57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


73. 


74. 


75. 


76. 


77. 


78. 


79. 


80. 


81. 


82. 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


90. 


91. 


Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health 
disparities. Nat. Genet. 51, 584-591 (2019). 

Anelegant distillation of the ways in which the historical focus of genetic discovery 
among populations of European descent has adverse consequences for both discovery 
and translation. 

Zeng, J. et al. Signatures of negative selection in the genetic architecture of human 
complex traits. Nat. Genet. 50, 746-753 (2018). 

Genovese, G. et al. Association of trypanolytic ApoL1 variants with kidney disease in 
African Americans. Science 329, 841-845 (2010). 

Luzzatto, L., Usanga, F. A. & Reddy, S. Glucose-6-phosphate dehydrogenase deficient red 
cells: resistance to infection by malarial parasites. Science 164, 839-842 (1969). 

Wojcik, G. L. et al. Genetic analyses of diverse populations improves discovery for 
complex traits. Nature 570, 514-518 (2019). 

Farh, K. K. et al. Genetic and epigenetic fine mapping of causal autoimmune disease 
variants. Nature 518, 337-343 (2015). 

An early demonstration of the value of overlapping genome-wide association data with 
tissue-specific regulatory maps to define the cell-types and tissues likely to be driving 
disease pathology. 

Freund, M. K. et al. Phenotype-specific enrichment of Mendelian disorder genes near 
GWAS regions across 62 complex traits. Am. J. Hum. Genet. 103, 535-552 (2018). 

Boyle, E. A., Li, Y. |. & Pritchard, J. K. An expanded view of complex traits: from polygenic 
to omnigenic. Cell 169, 1177-1186 (2017). 

de Lange, K. M. et al. Genome-wide association study implicates immune activation of 
multiple integrin genes in inflammatory bowel disease. Nat. Genet. 49, 256-261 (2017). 
Sekar, A. et al. Schizophrenia risk from complex variation of complement component 4. 
Nature 530, 177-183 (2016). 

Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of 
inflammatory bowel disease. Nature 491, 119-124 (2012). 

Richardson, T. G., Harrison, S., Hemani, G. & Davey Smith, G. An atlas of polygenic risk 
score associations to highlight putative causal relationships across the human phenome. 
eLife 8, €43657 (2019). 

Dendrou, C. A. et al. Resolving TYK2 locus genotype-to-phenotype differences in 
autoimmunity. Sci. Transl. Med. 8, 363ra149 (2016). 

Voight, B. F. et al. Plasma HDL cholesterol and risk of myocardial infarction: a Mendelian 
randomisation study. Lancet 380, 572-580 (2012). 

Khera, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals 
with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219-1224 (2018). 

This paper used large-scale genetic data to demonstrate the clinical potential of the 
polygenic scores that can be constructed for many common diseases, emphasizing 
that, in some situations, the lifetime risk of disease for those with the highest scores 
approaches that of established monogenic disease. 

Mahmood, S. S., Levy, D., Vasan, R. S. & Wang, T. J. The Framingham Heart Study and the 
epidemiology of cardiovascular disease: a historical perspective. Lancet 383, 999-1008 
(2014). 

Gudbjartsson, D. F. et al. Variants conferring risk of atrial fibrillation on chromosome 
4q25. Nature 448, 353-357 (2007). 

Thorgeirsson, T. E. et al. A variant associated with nicotine dependence, lung cancer and 
peripheral arterial disease. Nature 452, 638-642 (2008). 

Collins, F. S. & Varmus, H. A new initiative on precision medicine. N. Engl. J. Med. 372, 
793-795 (2015). 

Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. 
Nature 562, 203-209 (2018). 

Nagai, A. et al. Overview of the BioBank Japan Project: study design and profile. 

J. Epidemiol. 27, S2-S8 (2017). 

Chen, Z. et al. China Kadoorie Biobank of 0.5 million people: survey methods, baseline 
characteristics and long-term follow-up. Int. J. Epidemiol. 40, 1652-1666 (2011). 

Kohane, I. S. Using electronic health records to drive discovery in disease genomics. 
Nat. Rev. Genet. 12, 417-428 (2011). 

Abul-Husn, N. S. & Kenny, E. E. Personalized medicine and the power of electronic health 
records. Cell 177, 58-69 (2019). 

Sun, B. B. et al. Genomic atlas of the human plasma proteome. Nature 558, 73-79 (2018). 
Shin, S. Y. et al. An atlas of genetic influences on human blood metabolites. Nat. Genet. 
46, 543-550 (2014). 

Sung, Y. J. et al. A multi-ancestry genome-wide study incorporating gene-smoking 
interactions identifies multiple new loci for pulse pressure and mean arterial pressure. 
Hum. Mol. Genet. 28, ddz070 (2019). 

Pendergrass, S. A. & Crawford, D. C. Using electronic health records to generate 
phenotypes for research. Curr. Protoc. Hum. Genet. 100, e80 (2019). 

Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to 
discover gene-disease associations. Bioinformatics 26, 1205-1210 (2010). 

Liu, Z. et al. Mendelian randomization analysis dissects the relationship between NAFLD, 
T2D and obesity and provides implications for precision medicine. Preprint at bioRxiv 
https://doi.org/10.1101/657734 (2019). 

Cotsapas, C. et al. Pervasive sharing of genetic effects in autoimmune disease. PLoS 
Genet. 7, €1002254 (2011). 

Kanai, M. et al. Genetic analysis of quantitative traits in the Japanese population links cell 
types to complex human diseases. Nat. Genet. 50, 390-400 (2018). 

Gudmundsson, J. et al. Two variants on chromosome 17 confer prostate cancer risk, and 
the one in TCF2 protects against type 2 diabetes. Nat. Genet. 39, 977-983 (2007). 
Abul-Husn, N. S. et al. A protein-truncating HSD17B73 variant and protection from chronic 
liver disease. N. Engl. J. Med. 378, 1096-1106 (2018). 

Plenge, R. M., Scolnick, E. M. & Altshuler, D. Validating therapeutic targets through human 
genetics. Nat. Rev. Drug Discov. 12, 581-594 (2013). 

Nielsen, J. B. et al. Biobank-driven genomic discovery yields new insight into atrial 
fibrillation biology. Nat. Genet. 50, 1234-1239 (2018). 

Flannick, J. et al. Exome sequencing of 20,791 cases of type 2 diabetes and 

24,440 controls. Nature 570, 71-76 (2019). 


188 | Nature | Vol577 | 9 January 2020 


92. 


93. 


94. 


95. 


96. 


97. 


98. 


99. 


100. 


101. 


07. 


109. 


110. 


111. 


112. 


113. 


114. 


115. 
116. 


117. 


118. 


119. 


120. 


121. 


122. 


123. 


124. 


125. 


02. 


03. 


04. 


05. 


06. 


08. 


Van Hout, C. V. et al. Whole exome sequencing and characterization of coding variation in 
49,960 individuals in the UK Biobank. Preprint at bioRxiv https://doi.org/10.1101/572347 
(2019). 
Khera, A. V. et al. Whole-genome sequencing to characterize monogenic and polygenic 
contributions in patients hospitalized with early-onset myocardial infarction. Circulation 
139, 1593-1602 (2019). 
Grarup, N. et al. Loss-of-function variants in ADCY3 increase risk of obesity and type 2 
diabetes. Nat. Genet. 50, 172-174 (2018). 
Cohen, J. C., Boerwinkle, E., Mosley, T. H. Jr & Hobbs, H. H. Sequence variations in PCSK9, 
low LDL, and protection against coronary heart disease. N. Engl. J. Med. 354, 1264-1272 
(2006). 
Rivas, M. A. et al. Deep resequencing of GWAS loci identifies independent rare variants 
associated with inflammatory bowel disease. Nat. Genet. 43, 1066-1073 (2011). 
Chong, J. X. et al. The genetic basis of Mendelian phenotypes: discoveries, challenges, 
and opportunities. Am. J. Hum. Genet. 97, 199-215 (2015). 
Goldstein, J. L. & Brown, M. S. A century of cholesterol and coronaries: from plaques to 
genes to statins. Cell 161, 161-172 (2015). 
Peltonen, L., Perola, M., Naukkarinen, J. & Palotie, A. Lessons from studying monogenic 
disease for common disease. Hum. Mol. Genet. 15, R67-R74 (2006). 
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. 
Nat. Genet. 47, 1236-1241 (2015). 
Musunuru, K. et al. From noncoding variant to phenotype via SORT1 at the 1p13 
cholesterol locus. Nature 466, 714-719 (2010). 
Widely accepted as the first paper to describe the detailed mechanistic dissection of a 
complex trait locus and still one of relatively few examples of success in this 
endeavour. 
Claussnitzer, M. et al. FTO obesity variant circuitry and adipocyte browning in humans. 
N. Engl. J. Med. 373, 895-907 (2015). 
Smemo, S. et al. Obesity-associated variants within FTO form long-range functional 
connections with IRX3. Nature 507, 371-375 (2014). 
Gupta, R. M. et al. A genetic variant associated with five vascular diseases is a distal 
regulator of endothelin-1 gene expression. Cell 170, 522-533 (2017). 
Small, K. S. et al. Regulatory variants at KLF14 influence type 2 diabetes risk via a female- 
specific effect on adipocyte size and body composition. Nat. Genet. 50, 572-580 
(2018). 
Lin, Y. T. et al. APOE4 causes widespread molecular and cellular alterations associated 
with Alzheimer’s disease phenotypes in human iPSC-derived brain cell types. Neuron 98, 
1294 (2018). 
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human 
genome. Nature 489, 57-74 (2012). 
Roadmap Epigenomics Consortium et al. Integrative analysis of 111 reference human 
epigenomes. Nature 518, 317-330 (2015). 
The flagship paper from the NIH Roadmap Epigenomics Consortium, describing a 
detailed analysis of the epigenomic profiles of more than 100 human cell types and 
characterizing how these regulatory patterns relate to gene regulation, cellular 
differentiation and human disease. 
Tansey, K. E., Cameron, D. & Hill, M. J. Genetic risk for Alzheimer’s disease is concentrated 
in specific macrophage and microglial transcriptional networks. Genome Med. 10, 14 
(2018). 
Gamazon, E. R. et al. Using an atlas of gene regulation across 44 human tissues to inform 
complex disease- and trait-associated variation. Nat. Genet. 50, 956-967 (2018). 
Ongen, H. et al. Estimating the causal tissues for complex traits and diseases. Nat. Genet. 
49, 1676-1683 (2017). 
GTEx Consortium. Genetic effects on gene expression across human tissues. Nature 550, 
204-213 (2017). 
One of several papers describing the results obtained from the Genotype-Tissue 
Expression (GTEx) project, which used RNA-sequence data from more than 40 human 
tissues in several hundred individuals to explore the relationship between DNA 
sequence variation and tissue-specific expression. 
Chiou, J. et al. Single cell chromatin accessibility reveals pancreatic islet cell type- and 
state-specific regulatory programs of diabetes risk. Preprint at bioRxiv https://doi.org/ 
10.1101/693671 (2019). 
HuBMAP Consortium. The human body at cellular resolution: the NIH Human 
Biomolecular Atlas Program. Nature 574, 187-192 (2019). 
Regev, A. et al. The Human Cell Atlas. eLife 6, e27041 (2017). 
Lloyd, K. C. A knockout mouse resource for the biomedical research community. Ann. NY 
Acad. Sci. 1245, 24-26 (2011). 
Finucane, H. K. et al. Heritability enrichment of specifically expressed genes identifies 
disease-relevant tissues and cell types. Nat. Genet. 50, 621-629 (2018). 
Delaneau, O. et al. Chromatin three-dimensional interactions mediate genetic effects on 
gene expression. Science 364, eaat8266 (2019). 
Zhou, J. & Troyanskaya, O. G. Predicting effects of noncoding variants with deep learning- 
based sequence model. Nat. Methods 12, 931-934 (2015). 

elley, D. R., Snoek, J. & Rinn, J. L. Basset: learning the regulatory code of the accessible 
genome with deep convolutional neural networks. Genome Res. 26, 990-999 (2016). 
Patwardhan, R. P. et al. High-resolution analysis of DNA regulatory elements by synthetic 
saturation mutagenesis. Nat. Biotechnol. 27, 1173-1175 (2009). 
Melnikov, A. et al. Systematic dissection and optimization of inducible enhancers in 
human cells using a massively parallel reporter assay. Nat. Biotechnol. 30, 271-277 
(2012). 
Wang, X. et al. High-resolution genome-wide functional dissection of transcriptional 
regulatory regions and nucleotides in human. Nat. Commun. 9, 5380 (2018). 
Potting, C. et al. Genome-wide CRISPR screen for PARKIN regulators reveals 
transcriptional repression as a determinant of mitophagy. Proc. Natl Acad. Sci. USA 115, 
E180-E189 (2018). 
Gasperini, M. et al. A genome-wide framework for mapping gene regulation via cellular 
genetic screens. Cell 176, 377-390 (2019). 


27. 


31. 


37. 


141. 


142. 


143. 


144. 


145. 


146. 


147. 


148. 


149. 


150. 


151. 


152. 


153. 


154. 


155. 


156. 


157. 


158. 


159. 


160. 


161. 


26. 


28. 
29. 


30. 


32. 


33. 


34. 


35. 


36. 


38. 


39. 


40. 


Fulco, C. P. et al. Activity-by-contact model of enhancer-promoter regulation from 
thousands of CRISPR perturbations. Nat. Genet. 51, 1664-1669 (2019). 

Komor, A. C., Kim, Y. B., Packer, M. S., Zuris, J. A. & Liu, D. R. Programmable editing of a 
target base in genomic DNA without double-stranded DNA cleavage. Nature 533, 
420-424 (2016). 

Feldman, D. et al. Optical pooled screens in human cells. Cell 179, 787-799 (2019). 
Dixit, A. et al. Perturb-Seq;: dissecting molecular circuits with scalable single-cell RNA 
profiling of pooled genetic screens. Cell 167, 1853-1866 (2016). 

Datlinger, P. et al. Pooled CRISPR screening with single-cell transcriptome readout. 
Nat. Methods 14, 297-301 (2017). 
Ding, L. et al. Perspective on oncogenic processes at the end of the beginning of cancer 
genomics. Cell 173, 305-320 (2018). 
Chae, Y. K. et al. Path toward precision oncology: review of targeted therapy studies and 
tools to aid in defining “actionability” of a molecular lesion and patient management 
support. Mol. Cancer Ther. 16, 2645-2655 (2017). 
Shearer, A. E. & Smith, R. J. Massively parallel sequencing for genetic diagnosis of hearing 
loss: the new standard of care. Otolaryngol. Head Neck Surg. 153, 175-182 (2015). 

Yang, Y. et al. Molecular findings among patients referred for clinical whole-exome 
sequencing. J. Am. Med. Assoc. 312, 1870-1879 (2014). 

Friedman, J. M. et al. Genome-wide sequencing in acutely ill infants: genomic medicine’s 
critical application? Genet. Med. 21, 498-504 (2019). 

Clark, M. M. et al. Diagnosis of genetic diseases in seriously ill children by rapid whole- 
genome sequencing and automated phenotyping and interpretation. Sci. Transl. Med. 11, 
eaat6177 (2019). 

Eldomery, M. K. et al. Lessons learned from additional research analyses of unsolved 
clinical exome cases. Genome Med. 9, 26 (2017). 

Boycott, K. M. et al. A diagnosis for all rare genetic diseases: the horizon and the next 
frontiers. Cell 177, 32-37 (2019). 

Liu, S. et al. Genomic analyses from non-invasive prenatal testing reveal genetic 
associations, patterns of viral infections, and Chinese population history. Cell 175, 
347-359 (2018). 

Thung, D. T., Beulen, L., Hehir-Kwa, J. & Faas, B. H. Implementation of whole genome 
massively parallel sequencing for noninvasive prenatal testing in laboratories. Expert Rev. 
Mol. Diagn. 15, 111-124 (2015). 

Lo, J. O., Cori D, F., Norton, M. E. & Caughey, A. B. Noninvasive prenatal testing. Obstet. 
Gynecol. Surv. 69, 89-99 (2014). 

Watson, M. S. et al. Newborn screening: toward a uniform screening panel and system— 
executive summary. Pediatrics 117, S296-S307 (2006). 

Currier, R. J. et al. Genomic sequencing in cystic fibrosis newborn screening: what works 
best, two-tier predefined CFTR mutation panels or second-tier CFTR panel followed by 
third-tier sequencing? Genet. Med. 19, 1159-1163 (2017). 

Bauer, P. E. “Tell them it’s not so bad”: prenatal screening for Down syndrome and the bias 
toward abortion. Intellect. Dev. Disabil. 46, 247-251 (2008). 

Volpi, S. et al. Research directions in the clinical implementation of pharmacogenomics: 
an overview of US Programs and Projects. Clin. Pharmacol. Ther. 103, 778-786 (2018). 
International Schizophrenia Consortium. Common polygenic variation contributes to risk 
of schizophrenia and bipolar disorder. Nature 460, 748-752 (2009). 

Evans, D. M., Visscher, P. M. & Wray, N. R. Harnessing the information contained within 
genome-wide association studies to improve individual prediction of complex disease 
risk. Hum. Mol. Genet. 18, 3525-3531 (2009). 

Kurian, A. W. et al. Genetic testing and results in a population-based cohort of breast 
cancer patients and ovarian cancer patients. J. Clin. Oncol. 37, 1305-1315 (2019). 

Katz, S. J. et al. Association of attending surgeon with variation in the receipt of genetic 
testing after diagnosis of breast cancer. JAMA Surg. 153, 909-916 (2018). 

Kalokairinou, L. et al. Legislation of direct-to-consumer genetic testing in Europe: a 
fragmented regulatory landscape. J. Community Genet. 9, 117-132 (2018). 

Allyse, M. A., Robinson, D. H., Ferber, M. J. & Sharp, R. R. Direct-to-Consumer Testing 2.0: 
emerging models of direct-to-consumer genetic testing. Mayo Clin. Proc. 93, 113-120 (2018). 
Wright, C. F. et al. Assessing the pathogenicity, penetrance, and expressivity of putative 
disease-causing variants in a population setting. Am. J. Hum. Genet. 104, 275-286 (2019). 
ACMG Board of Directors. The use of ACMG secondary findings recommendations for 
general population screening: a policy statement of the American College of Medical 
Genetics and Genomics (ACMG). Genet. Med. 21, 1467-1468 (2019). 

George, A., Kaye, S. & Banerjee, S. Delivering widespread BRCA testing and PARP 
inhibition to patients with ovarian cancer. Nat. Rev. Clin. Oncol. 14, 284-296 (2017). 
CenterWatch. FDA Approved Drugs for Genetic Disease (CenterWatch); https://www. 
centerwatch.com/drug-information/fda-approved-drugs/therapeutic-area/34/genetic- 
disease 

Wu, Y. et al. Highly efficient therapeutic gene editing of human hematopoietic stem cells. 
Nat. Med. 25, 776-783 (2019). 

Jaiswal, S. et al. Clonal hematopoiesis and risk of atherosclerotic cardiovascular disease. 
N. Engl. J. Med. 377, 111-121 (2017). 

Lee, M. H. et al. Somatic APP gene recombination in Alzheimer’s disease and normal 
neurons. Nature 563, 639-645 (2018). 

H3Africa Consortium. Enabling the genomic revolution in Africa. Science 344, 1346-1348 
(2014). 

Dolman, L. et al. ClinGen advancing genomic data-sharing standards as a GA4GH driver 
project. Hum. Mutat. 39, 1686-1689 (2018). 

Fairfax, B. P. et al. Innate immune activity conditions the effect of regulatory variants upon 
monocyte gene expression. Science 343, 1246949 (2014). 


162. Nelson, M. R. et al. The support of human genetic evidence for approved drug 
indications. Nat. Genet. 47, 856-860 (2015). 

Aninfluential paper that demonstrates that the probability of therapeutic success is 
enhanced (by around a factor of 2) for drug targets with supportive evidence from 
human genetics. 

63. Veitch, D. P. et al. Understanding disease progression and improving Alzheimer’s disease 
clinical trials: recent highlights from the Alzheimer’s Disease Neuroimaging Initiative. 
Alzheimers Dement. 15, 106-152 (2019). 

64. Ashley, E. A. Towards precision medicine. Nat. Rev. Genet. 17, 507-522 (2016). 

65. Franks, P. W. & McCarthy, M. |. Exposing the exposures responsible for type 2 diabetes 
and obesity. Science 354, 69-73 (2016). 

66. Sanna, S. et al. Causal relationships among the gut microbiome, short-chain fatty acids 
and metabolic diseases. Nat. Genet. 51, 600-605 (2019). 

67. Udler, M. S. et al. Type 2 diabetes genetic loci informed by multi-trait associations point to 
disease mechanisms and subtypes: a soft clustering analysis. PLoS Med. 15, e1002654 
(2018). 

68. Rose, G. Sick individuals and sick populations. Int. J. Epidemiol. 14, 32-38 (1985). 

69. Hindorff, L. A. et al. Prioritizing diversity in human genomics research. Nat. Rev. Genet. 19, 
175-185 (2018). 

70. Wittrup, A. & Lieberman, J. Knocking down disease: a progress report on siRNA 
therapeutics. Nat. Rev. Genet. 16, 543-552 (2015). 

71. Schuster, S. J. et al. Chimeric antigen receptor T cells in refractory B-cell lymphomas. 

N. Engl. J. Med. 377, 2545-2554 (2017). 

72. Ding, Q. et al. Permanent alteration of PCSKQ with in vivo CRISPR-Cas9 genome editing. 
Circ. Res. 115, 488-492 (2014). 

73. Risca, V. |. & Greenleaf, W. J. Unraveling the 3D genome: genomics tools for multiscale 
exploration. Trends Genet. 31, 357-372 (2015). 


Acknowledgements We acknowledge grant funding from the following funders. J.H.C.: NIH 
(U01DK062429, U01DK062422 and RO1DK106593); N.J.C.: NIH (UOIHGOO9086, U54MD010722 
and RO1MH113362); E.1.D.: Swiss National Science Foundation, Louis Jeantet Foundation; 
E.E.K.: NIH (RO1HGO10297, RO1HL104608, U01HGO09610, RO1DK110113, UOIHGO09080, 
X01HL1345 and UM1HGOO88001); C.M.L.: NIHR Oxford Biomedical Research Centre and NIH 
(5P50HDO28138-27); K.N.N.: NHMRC (APP1113531); S.E.P.: NIH (UO1IHGO06485 and 
1U41HGO09649); C.N.R.: NIH Intramural Program at the Center for Research on Genomics and 
Global Health and National Human Genome Research Institute; and M.1.M.: Wellcome 
(090532, 098381, 106130, 203141 and 212259) and NIH (U01-DK105535, DKO85545 and 
DK098032). Personal funding comes from the following sources. M.C.: Next Generation Fund 
at the Broad Institute of MIT and Harvard; J.H.C.: Sanford Grossman Charitable Trust and 
Helmsley Charitable Trust; R.C.: UKBiobank; C.M.L.: Lika Shing Foundation; J.S.: Howard 
Hughes Medical Institute; and M.1.M.: Wellcome and NIHR. 


Author contributions M.C. and M.I.M. coordinated the drafting of the Review. All other authors 
contributed to drafting of the text and approved the final version. 


Competing interests R.C. has received research grants from British Heart Foundation, Cancer 
Research UK, Medical Research Council, Merck & Co, UKBiobank, Wellcome and Medco; a 
Pfizer Prize Award (to NDPH) and is named on a patent for a statin-related myopathy genetic 
test. R.C. receives no personal remuneration from these: any share of royalties or other 
payments have been waived in favour of NDPH. E.T.D. is chairman and board member of 
Hybridstat and on the advisory board of DNAnexus. M.E.H. is a co-founder, shareholder and 
director of Congenica. S.K. has received research grants from Bayer and Novartis; is on 
Scientific Advisory Boards of Regeneron Genetics Center, Corvidia Therapeutics and Maze 
Therapeutics; has equity in San Therapeutics, Catabasis, Verve and Maze Therapeutics and is a 
consultant for Maze Therapeutics, Alynlam, ExpertConnect, Leerink Partners, Noble Insights, 
Bayer and Novo Ventures. E.E.K. receives honoraria from Illumina and Regeneron 
Pharmaceuticals. C.M.L. has research collaborations with Novo Nordisk and Bayer, receiving 
no personal payment. D.G.M. is co-founder and shareholder of Goldfinch Bio and has received 
research funding from AbbVie, Biogen, BioMarin, Merck, Pfizer and Sanofi-Genzyme. S.E.P. is a 
member of the Scientific Advisory Board of Baylor Genetics. J.S. is a member of the Scientific 
Advisory Board for Maze Therapeutics, Camp4 Therapeutics, Nanostring, Phase Genomics, 
Adaptive Biotechnology and Stratos Genomics, a founder of Phase Genomics and a consultant 
for Guardant Health. M.I.M. was a member of advisory panels for Pfizer, NovoNordisk and Zoe 
Global; received honoraria from Merck, Pfizer, NovoNordisk and Eli Lilly and research funding 
from Abbvie, Astra Zeneca, Boehringer Ingelheim, Eli Lilly, Janssen, Merck, NovoNordisk, 
Pfizer, Roche, Sanofi Aventis, Servier and Takeda. As of June 2019, M.I.M. is an employee of 
Genentech and a holder of Roche stock. The views expressed in this article are those of the 
author(s) and not necessarily those of the NHS, the NIHR, or the Department of Health. 


Additional information 

Correspondence and requests for materials should be addressed to M.1.M. 

Peer review information Nature thanks Ewan Birney, Jeff Schloss, Richard Trembath and the 
other, anonymous, reviewer(s) for their contribution to the peer review of this work. 
Reprints and permissions information is available at http://www.nature.com/reprints. 
Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© Springer Nature Limited 2020 


Nature | Vol577 | 9 January 2020 | 189 


Article 


Arepeating fast radio burst source localized 
to anearby spiral galaxy 


https://doi.org/10.1038/s41586-019-1866-z B. Marcote’, K. Nimmo2%, J. W. T. Hessels”**, S. P. Tendulkar*®, C. G. Bassa?, Z. Paragi’, 

A. Keimpema’, M. Bhardwaj**, R. Karuppusamy*, V. M. Kaspi**, C. J. Law’, D. Michilli**, 

K. Aggarwal®®, B. Andersen**, A. M. Archibald*"°, K. Bandura", G. C. Bower”, P. J. Boyle*®, 
C. Brar*®, S. Burke-Spolaor®®, B. J. Butler", T. Cassanelli*”, P. Chawla**, P. Demorest”, 

M. Dobbs**, E. Fonseca**, U. Giri”, D. C. Good", K. Gourdji®, A. Josephy**, 

A. Yu. Kirichenko”®”°, F. Kirsten”, T. L. Landecker”, D. Lang”, T. J. W. Lazio”’, D. Z. Li2*”°, 
H.-H. Lin”®, J. D. Linford", K. Masui?©’, J. Mena-Parra”°, A. Naidu**, C. Ng", C. Patel*®, 

U.-L. Pen”"*”5”8, Z, Pleunis**, M. Rafiei-Ravandi’®, M. Rahman”, A. Renard“, P. Scholz’*””, 


S.R. Siegel**, K. M. Smith”, I. H. Stairs", K. Vanderlinde™”® & A. V. Zwaniga*® 


Received: 17 October 2019 


Accepted: 8 November 2019 


Published online: 6 January 2020 


Fast radio bursts (FRBs) are brief, bright, extragalactic radio flashes’. Their physical 
origin remains unknown, but dozens of possible models have been postulated*. Some 
FRB sources exhibit repeat bursts*”. Although over a hundred FRB sources have been 
discovered®, only four have been localized and associated with a host galaxy’ “, and 
just one of these four is known to emit repeating FRBs”. The properties of the host 
galaxies, and the local environments of FRBs, could provide important clues about 
their physical origins. The first known repeating FRB, however, was localized to alow- 
metallicity, irregular dwarf galaxy, and the apparently non-repeating sources were 
localized to higher-metallicity, massive elliptical or star-forming galaxies, suggesting 
that perhaps the repeating and apparently non-repeating sources could have distinct 
physical origins. Here we report the precise localization of a second repeating FRB 
source’, FRB 180916.J0158+65, to a star-forming region ina nearby (redshift 0.0337 + 
0.0002) massive spiral galaxy, whose properties and proximity distinguish it from all 
known hosts. The lack of both a comparably luminous persistent radio counterpart 
and ahigh Faraday rotation measure’ further distinguish the local environment of 
FRB 180916.J0158+65 from that of the single previously localized repeating FRB 
source, FRB 121102. This suggests that repeating FRBs may have a wide range of 
luminosities, and originate from diverse host galaxies and local environments. 


The CHIME/FRB Collaboration is discovering repeating FRB sources”®, 
which allows subsequent targeted observations using distributed radio 
telescope arrays to obtain precise interferometric localizations. CHIME/ 
FRB discovered® the repeating source FRB 180916.JO158 + 65, which we 
observed with the very long baseline interferometry (VLBI) technique 
atacentral frequency of 1.7 GHz and bandwidth of 128 MHz using eight 
radio telescopes of the European VLBI Network (EVN) for 5.5 hon 19 
June 2019. As described in the Methods, we simultaneously recorded 


both EVN single-dish raw voltage data as well as high-time-resolution 
intensity data using the PSRIX data recorder” in filterbank mode at the 
100-m Effelsberg telescope. 

In a search of the PSRIX data, we detected four bursts from FRB 
180916.J0158+65 with signal-to-noise ratios S/N between 9.5 and 46. 
The observed dispersion measures (DM) of the bursts are consistent 
with those previously reported’® for this source. The burst properties, 
as derived from the PSRIX data, are listed in Extended Data Table 1. 
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Fig. 1| Burst detections in Effelsberg auto-correlation data. Band-averaged 
profiles and dynamic spectra of the four bursts, as detected using the 
coherently dedispersed Effelsberg auto-correlation data (a,b,c andd). Each 
burst was fitted with a Gaussian distribution to determine the full-width at half- 
maximum (FWHM) durations, which are represented by the dark cyan bars. The 
lighter cyan encloses the 2o region. Bursts B3 and B4 (cand d) show sub-bursts; 
the orange, purple and green bars correspond to the FWHM of these sub- 
bursts, andthe lighter bar (of each colour) encloses the 20 region. Note the 


No other dispersed single pulses of plausible astrophysical origin were 
found in this search, for DMs in the range 0-700 pccm °. Using the EVN 
raw voltage data, we generated high-time-resolution (16-1s samples) 
Effelsberg auto-correlation data containing each burst. We used coher- 
ent dedispersion to mitigate the intra-channel smearing that dominated 
over the temporal resolution inthe PSRIX data. By minimizing disper- 
sion broadening, we properly resolve the burst structure, and find a 
best-fit DM=348.76 + 0.10 pccm “using the brightest burst (Methods). 
We also detect brightness modulation witha characteristic frequency 
scale of 59 +13 kHz, which we interpret as scintillation imparted by the 
ionized interstellar medium of the Milky Way (Methods). The properties 
of the four bursts, as seen in the Effelsberg auto-correlations, are also 
shown in Extended Data Table 1. The frequency-averaged burst profiles 
and dedispersed dynamic spectra are shown in Fig. land Extended Data 
Fig. 1 for the Effelsberg auto-correlation and PSRIX data, respectively. 

We then used the EVN raw voltage data to create coherently dedis- 
persed cross-correlations, also knownas visibilities, at the times of the 
four bursts. Radio interferometric images with milli-arcsecond resolu- 
tion were produced for each individual burst, using a single 0.7-2.7-ms 


B2 
0.5 MHz 
0.400 ms 


-1.5 0 
Time (ms) 


1.5 3 


different time windows plotted: Bland B3 (aandc) show 12 ms, whereas B2 and 
B4 (band d) show 6 ms surrounding the burst peak. The solid white lines 
represent frequency channels that have been removed from the data due to 
either radio frequency interference (RFI) or subband edges, indicated by the 
red and blue markers, respectively. The burst data have been downsampled by 
various factors in time and frequency to optimize the visual representation. 
The RFl excision was done before downsampling. The time and frequency 
resolution used for plotting is shown inthe top right of each panel. 


integration, depending onthe burst duration. Each image shows emis- 
sion above at least seven times the root-mean-square (r.m.s.) noise 
level (Fig. 2; see Methods for a detailed explanation of the calibration 
and imaging process). The four burst images provide an averageJ2000 
position for FRB180916.JO158+65 of a=01h58 min 00.75017 s + 2.3 mas, 
6=65° 43’ 00.3152” + 2.3 mas. Visibilities with 2-s time resolution were 
also generated for the entire observation span. We used these to produce 
animage of the field around FRB 180916.J0158+65 in order to search for 
apersistent radio counterpart, like that seenin the case of FRB121102"™. 
No such emission is detected above a 3or.m.s. noise level of 30 py per 
beam. Independent observations using the Karl G. Jansky Very Large 
Array (VLA) at 1.6 GHz detect no coincident emission above a3or.m.s. 
noise level of 18 Jy per beam (see Methods and Extended Data Fig. 2). 

The precise EVN position shows that FRB 180916.J0158+65 is spatially 
coincident with a galaxy catalogued in the Sloan Digital Sky Survey 
as SDSS J015800.28+654253.0. Given the maximum expected redshift 
of approximately 0.11, as inferred by the measured DM (Methods), we 
find that the probability for chance coincidence is less than 1% for any 
type of galaxy with mass greater than about 40% that of the FRB 121102 
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Fig. 2|_EVNimages and burst positions. Each individual burst is imaged by 
using the EVN gated visibility data (a, b, cand d). e, Image using the full span of 
the EVN observation and 2-s integration time; no significant (>30) persistent 
radio emission is detected at the positions of the bursts (denoted by the red 
cross) and no signal above the 4a level is detected anywhere within the full 
field. Contours start at three times the r.m.s. noise level of each image (75 mJy 
per beam, 65 mJy per beam, 43 mJy per beam and100 mJy per beam and 9.7 pJy 
per beam, respectively) and increase by factors of /2. For all images, the 
synthesized beam is represented by the grey ellipse in the bottom-left corner. 
The Tianmastation was only included in the derivation of the continuum 
image (e). f, The positions derived from each individual burst and their 
associated louncertainties (orange), with respect to the weighted-average 
burst position (black). All positions are referenced to right ascension 
a(J2000) = 01h58 min 00.75017 s, declination 6 (J2000) =65° 43’ 00.3152”. 


host (Methods). Moreover, we thus conclude that the association of 
FRB 180916.J0158+65 to this particular galaxy is well established. FRB 
180916.J0158+65 is close to the plane of the Milky Way, with Galactic 
longitude /=129.7° and latitude b =3.7°. Its low DM excess compared to 
the Milky Way contribution (Methods) brought into question whether 
it could possibly be a Galactic disk or halo object®, but the host galaxy 
association shows that it is clearly extragalactic. 

We used the 8-m Gemini-North telescope to characterize the morphol- 
ogy of the host galaxy and to measure a spectroscopic redshift (Meth- 
ods). Deep optical imaging reveals that the host is a nearly face-on spiral 
galaxy (Fig. 3) with a total stellar mass of approximately 10” times that 
of the Sun (Methods), which is comparable to the total stellar mass of 
the Milky Way. No other comparably large and bright galaxy is visible in 
the broader field covered by Gemini-North (Extended Data Fig. 3). The 
milli-arcsecond precision of the EVN localization shows that the FRB 
source is close to a bright feature in the ”-band approximately 7 arcsec 
(projected separation of roughly 4.7 kpc) from the core of the host galaxy. 
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With Gemini-North long-slit spectroscopy, we simultaneously tar- 
geted the host galaxy centre and the offset position of FRB 180916. 
JO158+65 (Extended Data Fig. 3). This revealed strong Ha emission, and 
several other spectral lines frequently associated with star formation 
(Fig. 3). By measuring optical line ratios, we confirm that the host is 
indeed a star-forming galaxy (Extended Data Fig. 5). 

Comparing with the rest frequencies of the lines, we find a redshift 
Z= 0.0337 + 0.0002, which corresponds to a luminosity distance of 
149.0 + 0.9 Mpc (or an angular size distance of 139.4 + 0.8 Mpc) using 
standard cosmological parameters”®. FRB 180916.J0158+65 is thus 
the closest-known FRB source with a robust host galaxy identification 
and measured redshift. It is a factor of six closer than the repeater FRB 
121102”, and more than an order-of-magnitude closer than the (thus 
far) non-repeating sources FRB 180924”, FRB 181112” and FRB 190523”. 

The optical spectrum at the location of FRB 180916.J0158+65 shows 
that both the associated star-forming clump and the host galaxy core 
are at the same redshift (Fig. 3). However, the spectrum at the location 
of FRB 180916.J0158+65 is dominated by emission from the clump, 
whichis offset from the centre of the slit. Hence we have no independent 
direct estimate of the emission or dispersion measure at the position 
of FRB 180916.J0158+65 from the Gemini-North data. By considering 
the various modelled foreground contributions to DM, we estimate 
that the host contribution is less than approximately 70 pc cm°, and 
could be substantially smaller (Methods). 

FRB 180916.JO158+65 is located at the apex of the v-shaped star-form- 
ing clump (Extended Data Fig. 4) witha relatively large’’ projected size of 
roughly 1.5 kpc and astar-formation rate of greater than about 0.016M, 
yr (and a star-formation surface density of about 107M, yr kpc”). 
Thev-shaped star-forming clump is a remarkable feature of the galaxy, 
suggesting the possibility that the region has undergone aninteraction 
that triggered the star formation, either between multiple star-forming 
regions or conceivably involving a putative dwarf satellite companion. 

The spiral host galaxy of FRB 180916.J0158+65 contains over 100 
times more stellar mass and has a metallicity five times higher (see 
Extended Data Fig. 5 and Methods) than the dwarf host galaxy of FRB 
121102. The discovery of this host demonstrates that some FRB sources 
exist in galaxies more similar to our own Milky Way. Previously, it has 
been noted” that the host of FRB 121102 is similar to the type of 
low-metallicity galaxies with high specific star-formation rate that are 
associated with hydrogen-poor superluminous supernovae and long- 
duration gamma-ray bursts. In contrast, the FRB 180916.J0158+65 host 
is unlike such galaxies; this weakens the case for a general link between 
all repeating FRB sources and these extreme astrophysical explosions. 
Inasearch of the Open Supernova Catalog”’, we find no previous super- 
novae or gamma-ray bursts at the location of FRB 180916.J0158+65. 

The proximity of FRB 180916.J0158+65 constrains the presence of 
any persistent radio counterpart to a luminosity (at a30 confidence 
level) of vL, <1.3 x 10* erg s ‘from the continuum EVN data (sensitive 
to milli-arcsecond scales) and to vL,<7.6 x10” erg s ‘fromthe VLA data 
(sensitive to emission on arcsecond scales; see Methods). Compared 
tothe persistent source associated with the repeating FRB 121102, this 
upper limit implies that any such source associated with FRB 180916. 
JO158+65 must be at least 400 times fainter than the one associated with 
FRB 121102“. The previously determined® Faraday rotation measure 
(RM) of FRB180916.J0158+65, RM =-114.6 + 0.6 rad m ~~, is three orders 
of magnitude lower than that of FRB 121102”, where RM = 10° rad m~. 
As previously suggested®, we conclude that FRB 180916.JO158+65 is 
located ina much less extreme local environment compared with FRB 
121102, and that the physical mechanism for FRB repetition does not 
depend onsuchconditions. Nonetheless, models originally proposed 
for FRB 121102””—in which the bursts originate from a young and 
rapidly rotating magnetar—could potentially still explain the observed 
properties of FRB 180916.J0158+65 by invoking an age of approximately 
300 yr, which is ten times older than that proposed for FRB 121102 
(see Methods for a brief comparison to existing models). 
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Fig. 3 | Gemini-North host galaxy image and optical spectrum. a, Image of the 
host galaxy using ther’ filter. The position of FRB180916.JO158+65 is marked. 
The inset shows a higher-contrast zoom-in of the star-forming region 
containing FRB 180916.J0158+65 (marked by the red circle). The uncertainty in 
the position of FRB180916.JO158+65 is smaller than the resolution of the 
image. b, Sky-subtracted spectrum extracted froma 5-arcsec aperture around 


Although the host galaxies of FRB 180916.J0158+65 and FRB 121102 
are markedly different, both sources are located near or within a star- 
forming region in the host galaxy. This contrasts with the elliptical 
host galaxies of FRB 180924” and FRB 190523”, where there is com- 
paratively little active star formation, but may be consistent with the 
star-forming galaxy of FRB 181112”. This diversity in hosts and local 
environments allows for the possibility that repeating and apparently 
non-repeating FRB sources have physically distinct origins. However, 
comparison of FRB event rates with those of proposed progenitors 
disfavours models that invoke cataclysmic explosions and suggests 


the host galaxy core (blue) and a 2-arcsec aperture around the location of FRB 
180916.JO158+65 (orange, scaled bya factor of five for clarity). Emission lines 
are identified along with their rest-frame wavelengths in air. Owing to the 


complicated shape of the galaxy, the flux densities, F,, have not been corrected 
for slit losses. 


that a large fraction of sources must be capable of repeating’. The 
recent finding that FRB 171019 produces repeat bursts that are almost 
600 times fainter than the originally discovered signal’ underscores 
the fact that the detectability of repetition depends on instrumental 
sensitivity and source proximity. If FRB 180916.J0158+65 were at the 
distance of the other well localized FRBs, only a small fraction of its 
(brightest) bursts would be visible. 

Furthermore, it has been proposed that a young magnetar origin for 
the bursts of as-yet non-repeating FRBs in non-star-forming regions is 
still viable as long as it is possible to form such sources through a variety 
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Table 1| Burst properties 


Burst WW, (ms) W,uoi(ms) — W,,42 (ms) W,,»3(ms)  Fluence(Jyms)* Spectral energy density (10% ergHz")* Aa(mas) Ad(mas) §, (Jy)? 

B1 1.864013 - - - 0.72 1.90 -2143.2 1143.2 0.74 + 0.08 
B2 0.24+0.02 - - - 0.20 0.53 1542.9 -0.2+2.9 0.66+0.05 
B3 172+014 0O14+0.02 012+0.01 - 0.62 1.64 2.84+2.8 -14+28 050+0.04 
B4 166+0.05° 0.24+0.01 0.06+0.0006 0.68+0.03 2.53 6.68 01419 03419 2.3403 


Widths, fluences and spectral energy densities are determined using the Effelsberg auto-correlation data of the bursts from FRB 180916.J0158+65, with a centre frequency of 1.7 GHz and 
dedispersed for a DM = 348.76 pc cm*®. W,, is the FWHM of the total burst envelope, and W,,,, represents the FWHM of any sub-bursts, ordered according to their arrival time (the earliest 
sub-burst is labelled ‘sub1’). The fluence is determined by integrating the burst profile over W,,. and is converted to physical units using the radiometer equation™ (see Methods). The spectral 
energy density is estimated using the measured fluence and luminosity distance. The position offsets (Aa, Ad) are referred to the average J2000 burst position of a = 01h 58 min 00.75017 s, 
6 =+65° 43' 00.3152". The flux densities S, are measured by fitting a circular Gaussian to the EVN visibility data and are averages over the gate width. 

*At 1.7-GHz reference frequency, and corresponding to the sum of all sub-bursts. A conservative fractional error of 30% is adopted for the derived fluences and energy densities. These are 


assumed to be isotropic. 


At 1.7-GHz reference frequency. The absolute flux scale may exhibit an additional ~15% uncertainty due to possible systematic gain calibration offsets. 


°See Methods for how the total burst width of B4 was determined. 


of channels, including direct stellar collapse, accretion-induced collapse, 
and through the merger of compact objects”. Ultimately, alarger number 
of precision localizations is needed before we can establish that multiple 
physical origins are required to explain the observed FRB phenomenon. 
There arenow11 repeating FRBs published‘ ’, and with precision localiza- 
tions it will be possible to establish whether repeating and apparently 
non-repeating FRB sources have demonstrably different environments. 

FRBs have now been localized with luminosity distances that span 
approximately 150 Mpc to4 Gpc. Estimating distance purely based on 
DM, it appears likely that there are FRBs that are even closer” or more 
distant®°”” compared with this range. The four bursts presented here 
have isotropic-equivalent spectral energy densities as low as approxi- 
mately 5 x 10” erg Hz‘ (Table 1). Assuming similar beaming fractions, 
this makes them close to an order of magnitude less energetic than 
the weakest bursts seen from FRB 121102 to date’’, and between four 
to six orders-of-magnitude less energetic than FRB 180924" and FRB 
190523"°. Unless multiple models are invoked, a viable model for FRBs 
must address this large range of (apparent) energy outputs. 

Comparing instead with pulsar emission, we note that the bursts from 
FRB 180916.J0158+65 are still at least a million times more energetic 
than the brightest giant pulses seen from the Crab pulsar—suggesting 
that they are not simply an exceptionally bright version of the known 
pulsar giant pulse phenomenon”. 

As previously suggested®, the proximity of FRB 180916.J0158+65 is 
an advantage for multi-wavelength follow-up of the host galaxy and 
local environment. Whereas targeted observations of FRB 121102 have 
failed to detect either prompt or persistent optical, X-ray or gamma-ray 
counterparts*” ’, similar observations towards FRB 180916.JO158+65 
may strongly constrain magnetar-based models, even in the event of 
non-detections. FRB 180916.J0158+65 is thus one of the most promising 
known sources for understanding the nature of FRBs. 
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Methods 


Apriorilocalization 

The CHIME/FRB Collaboration discovered® the repeating source FRB 
180916.J0158+65 and refined its position using the CHIME/FRB base- 
band mode. The automatic baseband triggering system of CHIME/ 
FRB* captured’ raw voltages for a burst from FRB 180916.JO158+65. The 
intensity of the signal inthe region surrounding the original detection 
was then calculated by producing a grid of tightly packed tied-array 
beams. A refined position was obtained by fitting this intensity map with 
asimple model of the telescope beam. The technique will be described 
in more detail ina future paper (D.M. et al., manuscript in preparation). 
Since the analysis strategy was still in a preliminary stage at the time, we 
estimate a systematic uncertainty on this position of roughly 3 arcmin, 
based onasmall sample of four known pulsars. CHIME/FRB baseband 
localizations are expected to have higher precision in the future. 


European VLBI Network and Effelsberg observations 

The EVN observed the field of FRB 180916.J0158+65 on 19 June 2019 
for 5.5h at a central frequency of 1.7 GHz and with a bandwidth of 
128 MHz. The phase centre was placed at the position provided by 
the initial CHIME/FRB baseband localization: a= 01h 57 min 43.2s, 
6=65° 42’ 01.02” (J2000 coordinates). 

A total of eight dishes participated in the EVN observations: the 
100-m Effelsberg, the 65-m Tianma, the 32-m Medicina, the 32-m 
Torun, the 32-m Irbene, the 25-m x 38-m Jodrell Bank Mark II, the 25-m 
Onsala, and a single 25-m dish from the Westerbork array. The data 
were streamed in real time (e-EVN setup) to the EVN Software Corre- 
lator (SFXC*) at the Joint Institute for VLBI ERIC (JIVE) in Dwingeloo, 
The Netherlands. The real-time visibility data are comprised of eight 
16-MHz subbands of 32 channels each, with full circular polarization 
products, and 2-s time averaging. 

In parallel, we buffered the individual station raw voltage data in 
order to allow high-time-resolution correlations afterwards, at the 
times of any detected bursts. This method allows one to recover the 
signal from any position within the primary beam of the antennas 
(which have a full-width at half-maximum, FWHM, of roughly 7 arcmin 
inthe case of the 100-m Effelsberg dish, and roughly 30 arcmin for the 
25-m antennas), which would otherwise be smeared due to time and 
frequency averaging if it is more than tens of arcseconds away from 
the phase centre used in cross-correlation. We observed 3C454.3 and 
JO745+1011 as fringe-finders and bandpass calibrators. JO207+6246 
was observed as phase calibrator (located 3.1° away from FRB 180916. 
JO158+65) ina phase-referencing cycle of 2 min onthe calibrator and 5 
min onthe target, FRB 180916.J0158+65. We also observed JO140+6346 
asa check source (3.1° away from the same phase calibrator) following 
the same phase-referencing cycle. 

Simultaneously, we recorded high-time-resolution filterbank data 
at 1.7 GHz using the 100-m Effelsberg telescope and the PSRIX data 
recorder”, which is designed for pulsar observations. We observed FRB 
180916.J0158+65 for a total on-source time of 3.47 hin the frequency 
range 1597-1737 MHz. This total bandwidth was divided into 144 spec- 
tral channels of 0.98 MHz each. Of these, the bottom 48 channels were 
corrupted by RFI, probably from Iridium satellites. Consequently, these 
channels were removed from the data before beginning the analysis, 
giving a usable bandwidth of 93.75 MHz. The data were recorded with 
full Stokes information and a time resolution of 81.92 ps. Before the 
beginning of the observation, we performed a short test scan on the 
known pulsar PSR B2111+46, to verify the data integrity. 


Interferometric data reduction 

The EVN data were analysed using standard procedures within the 
Astronomical Image Processing System (AIPS”) and Difmap** software 
packages. A priori amplitude calibration was performed using the 
known gain curves and system temperature measurements recorded 


at each station during the observations. Poor data, primarily due to 
RFI, were flagged manually (less than about 10% of the total data). The 
remaining data were then fringe-fitted and bandpass calibrated using 
the fringe-finders and phase calibrator, which were imaged and self- 
calibrated to improve the final calibration of the data. The obtained 
solutions were transferred to FRB 180916.J0158+65 and JO140+6346 
before creating the final images. 

The check source JO140+6346 was used to estimate both the abso- 
lute astrometric uncertainty and the potential amplitude losses that 
could have been introduced due to the phase-referencing technique. 
Although the latter accounted for less than approximately 10% of the 
total flux density scale, we found a larger-than-expected offset (about 
4 mas) of the centroid of the check source position with respect to its 
known coordinates from the International Celestial Reference Frame. 
The origin of this offset is explained by both the uncertainties asso- 
ciated with the phase-referencing technique” *! and the extended 
structure of JO207+6246, the phase calibrator. The core of this source 
shows multiple components that make the determination of the true 
position of the source ambiguous at the milli-arcsecond level. We cor- 
rected for this observed offset in all burst positions presented in this 
study, but an uncertainty on the final absolute positions of +1.7 mas and 
+2.1mas in right ascension and declination, respectively, still remains 
owing to the ambiguity in the exact position of this reference source 
and its extension (the source is resolved on milli-arcsecond scales). 


Search for FRB 180916.J0158+65 bursts 

The PSRIX data were analysed in order to search for dispersed, milli- 
second-duration bursts, using the PRESTO” software package. The 
tool rfifind was used to identify frequency channels and time samples 
contaminated by RFI. We estimate that ~4% of the data were masked 
using rfifind. Incorporating the RFI mask, the data were then inco- 
herently dedispersed to DMs in the range 0-700 pc cm” in steps of 
1 pc cm? using prepsubband, generating dedispersed time series at 
each trial DM. To search for short-duration bursts, each dedispersed 
time series was convolved with boxcar functions of various widths, 
using single_pulse_search.py. Given the frequency resolution of the 
PSRIX data, the residual dispersive delay within each channel resultsina 
temporal smearing of about 0.6 ms at the expected° DM of FRB 180916. 
J0158+65 (349 pccm”®*). This intra-channel smearing dominates over the 
sampling time, causing a loss of sensitivity to extremely narrow bursts. 
We were sensitive to bursts with widths exceeding the intra-channel 
smearing time and up to 98.3 ms. A DM-versus-time plot (known asa 
single-pulse plot, generated by single pulse search.py) was created 
for each scan and inspected by eye for candidates. For each candidate 
identified, the dynamic spectrum was generated and inspected by eye 
to distinguish between astrophysical signals and terrestrial sources of 
RFI. During our 3.47-h on-source time, we found four bursts from FRB 
180916.J0158+65 at its known DM. As a final check, each dedispersed 
time series was filtered for RFI in the search for candidates with peak 
S/N > 7, using an automated classifier that returned only the same 
four bursts found in the initial manual search. 

Using the time of arrival of each individual burst, one-second-dura- 
tion Effelsberg auto-correlations were created for each burst, using the 
recorded EVN raw-voltage data. The auto-correlation data were gener- 
ated witha higher temporal resolution (16-ps time bins) and frequency 
resolution (62.5-kHz channels) compared with the PSRIX data, and 
were coherently dedispersed to 350 pc cm®. Coherent dedispersion 
mitigates the effects of intra-channel smearing to less than 3 ps for 
DMs that are within a few units of 350 pc cm *. This provided us with 
the opportunity to better study the burst structure (limited only by our 
temporal resolution and the S/N), whichis unresolved in the PSRIX data. 

We identified the four bursts from FRB 180916.J0158+65 in the 
auto-correlations (Fig. 1 shows their band-averaged burst profiles 
and dynamic spectra). For comparison, we also show the profiles and 
dynamic spectra for the bursts in the PSRIX data (Extended Data Fig. 1). 


Article 


Burst properties 

Using the brightest burst, labelled B4, we determined a precise DM for 
FRB180916.JO158+65. We used the PSRCHIVE* tool pdmp to determine 
the DM that corresponds to a peak in the S/N. Because B4 has a bright 
and narrow sub-burst, optimizing the $/Nis probably also equivalent 
to optimizing the structure of the burst”, in this case. We searched 
DMs of 350 +3 pc cm in steps of 0.01 pc cm™ using pdmp, and found 
the optimal DM to be 348.76 + 0.10 pc cm ° (consistent with the DM of 
349.2 + 0.3 pc cm? determined for this source by CHIME/FRB°). The 
bursts shown in both Fig. 1 and Extended Data Fig. 1 are dedispersed 
to this DM value. 

Table 1 lists the physical properties of the bursts using the Effels- 
berg auto-correlations, dedispersed to 348.76 pc cm™*. We measured 
the burst FWHM duration and peak time using a Gaussian fit*®. The 
brightest burst, B4, shows a complicated structure (as can be seen in 
Fig. 1). The width of each individual sub-burst in B4 was determined 
using the same Gaussian fit as above. The total width of B4, however, 
was measured by defining the beginning of the burst as the peak time 
of the first sub-burst minus half W,,,,,, and the end of the burst as the 
peak time of the last sub-burst plus half W,,,,3. 

To determine the fluence, we define a larger time window, determined 
by eye, than the quoted FWHM durations, which encompasses all of 
the burst flux. First, we produce atime series containing each burst by 
summing over all frequency channels. We define an off-pulse region, 
and use this to normalize the time series. We then convert the signal 
in each time bin to physical units by using the radiometer equation™, 
and sum over the burst duration to estimate the fluence. It should be 
noted that B1 and B3 were downsampled by a factor of four for this 
analysis, resulting in a lower time resolution than the other bursts. 
For the radiometer equation, we take typical values for the system 
temperature T,,,~ 20 K and telescope gain G~1.54 K Jy for Effelsberg. 
For the fluence values quoted in Table 1, we provide a conservative frac- 
tional error of 30%. Also shown in Table 1 are the burst spectral energy 
densities”, using the derived burst fluences and luminosity distance 
to FRB 180916.J0158+65. All four bursts are much dimmer than those 
previously detected® with CHIME/FRB, whichis unsurprising given the 
higher sensitivity of the 100-m Effelsberg telescope. 

Extended Data Table 1 shows the detection properties of the bursts, 
as seen in both the PSRIX data and the Effelsberg auto-correlations. For 
the PSRIX data, we quote the maximum S/N detected using PRESTO’s 
single pulse _search.py with an associated DM. The burst S/N for the 
auto-correlations is that measured at DM = 348.76 pccm ®, again using 
single pulse _search.py. The burst width in Extended Data Table 1is the 
boxcar width used in single pulse search.py resulting in the peak S/N 
value. The S/N of the four bursts, as discovered in the PSRIX data, are 
shown in Extended Data Fig. 6 as a function of time during our observa- 
tion. The first three bursts appear clustered in time, occurring within 
1.5h of the beginning of our observation. 

To study frequency-dependent brightness variations in the domi- 
nant sub-burst of B4—which are expected if there is scintillation—we 
conducted a standard auto-correlation analysis*®. We generated the 
spectrum of the burst within the FWHM duration (W,,,,. = 0.06 ms) 
using Effelsberg auto-correlations with a frequency resolution of 15.6 
kHz. Spectral channels containing substantial RFI, along with those 
at the edges of the 16-MHz subbands, were set to zero in the spectrum 
before generating the auto-correlation function (ACF). 

We calculated the ACF using the following expression: 


Y; (S(v) — S)(SQ;,+ Av) - 5) 


ACF(Av) = 
te JX (Sly) - 5)? |; (y+ Av) - S$)? 


(1) 


where vis the frequency, Avis the frequency lag, and S is the mean of 
the spectrum. In this expression each summation is over the indices i 
which give non-zero values for both S(v,) and S(v;+ Av) (thatis, we only 


include the ‘good’ data in the auto-correlation analysis). The ACF cal- 
culated using equation (1) is shown in Extended Data Fig. 7. 

The central part of the ACF (for lags in the range -1.016 to +1.016 
MHz) after removing the zero lag was then fitted with a Lorentzian 
function® of the form: 


a 
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using a least-squares fit, for free parameters: amplitude a; the decor- 
relation bandwidth (also known as the scintillation bandwidth) Av,, 
defined** as the half-width at half-maximum of the ACF); and offset b. 
Extended Data Fig. 7 shows the fit to the ACF. The Lorentzian fit, with 
127 degrees of freedom, returned a x’ of 105.2. We assume that the ACF 
between 3 MHzand 20 MHzlags contains no structure and represents 
the noise in the central profile of the ACF. We measure the standard 
deviation in this region and use this as our uncertainty for the calcula- 
tion of x. The P value of the fit is 0.92. 

We measure the scintillation bandwidth to be 59 + 13 kHz. This esti- 
mate is consistent with the NE2001 model® prediction of about 60 kHz 
along the line-of-sight at our observing frequency of 1.7 GHz. Using 
the simple relationship 


2nTAvy=1 (3) 


we estimate a scattering timescale t of about 2.7 ps, which we find is 
also similar to the NE2001 model prediction of around 2 ps along the 
line of sight®°. This constraint on the scattering time is much tighter 
compared to CHIME/FRB measurements? of FRB 180916.J0158+65 
bursts, even after accounting for the different observing frequencies 
and the typical v“ scaling of scattering time. CHIME/FRB nominally 
measured scattering times of approximately 2 ms for two FRB 180916. 
JO158+65 bursts, but these measurements are probably a reflection of 
burst morphology as opposed to genuine scattering effects (as was 
previously considered®). 


High-time-resolution correlation and burst imaging 

Coherently dedispersed visibilities for each burst were created by re- 
correlating the EVN data for only the time ranges corresponding to 
the burst durations. The exact time ranges were initially determined 
from the Effelsberg PSRIX data and then refined using coherently 
dedispersed Effelsberg auto-correlations. From the resulting auto- 
correlations we created pulse profiles and used these to set the optimal 
correlator gates applied to the interferometric data. 

The a priori position of FRB 180916.J0158+65 provided at the time 
of the EVN observation had an uncertainty of about 3 arcmin (see 
above), making it infeasible to localize the bursts by directly imaging 
this region at milli-arcsecond resolution (which would require an image 
with around 10” pixels). However, for strong signals, it is possible to 
estimate the position of the source from the geometric delays across the 
different telescope baselines. The differential geometric phase delay 
A@, of the source with respect to the phase centre (a priori pointing 
position) for a given baseline is given by”: 


Ad, = 2mv(uAacosé + vA6) (4) 


where vis the observed frequency, (Aa, A6) is the positional offset of 
the source with respect to the phase centre in right ascension (a) and 
declination (6), respectively, and (u, v) are the coordinate offsets for 
the given baseline. 

We used the strongest burst, B4, to estimate the offset of the source 
with respect to the phase centre. We then re-correlated the data at 
this refined J2000 position of a=01h58 min 00.5 s, 6 = 65°43’ 01.0” 
(with an estimated uncertainty of a few arcseconds), which was around 
2 arcmin from the a priori position—and thus consistent within the 


estimated uncertainties. We applied the previously described EVN 
calibration to these burst data and imaged the individual bursts. All 
bursts were detected above a 7o confidence level. Different imaging 
schemes were used during the imaging process in order to emphasize 
resolution (that is, contribution from the longest baselines) or sensitiv- 
ity (that is, contribution from the core of the array). The final images 
were produced by using a natural weighting and excluding data from 
the Tianma station which, despite providing the highest resolution, 
exhibited potential calibration issues that could affect the absolute 
burst positions by a few milli-arcseconds—especially for such short 
integration times, where the wv coverage is poor. Table 1 lists the burst 
properties obtained from the EVN images, which are shown in Fig. 2. An 
image of the combined burst data was also produced. This final image 
is dominated by the emission of the strongest burst, B4, and thus we 
only consider the results from the four bursts individually. 

The flux densities and positions were measured using Difmap and the 
Common Astronomy Software Applications (CASA) software packages 
by fitting a circular Gaussian distribution component to the detected 
emission in the wv plane, which is expected to be a robust method 
against station gain calibration uncertainties*’. The position of each 
burst was measured independently and then we determined a weighted 
averageJ2000 position for the source of a=01h58 min 00.75017s + 2.3 
mas, 6= 65° 43’ 00.3152” + 2.3 mas, where we used the fluence of each 
burst as weights. This is within 1.7 arcsec of the position determined 
via mapping the geometric delays. The final positional uncertainty 
reflects the statistical uncertainties from the individual position meas- 
urements (about 1 mas), the uncertainties in the absolute International 
Celestial Reference Frame position of the phase calibrator (JO207+6246; 
0.15 mas) and check source (J0140+6346; 0.1mas), and the systematic 
uncertainty associated with the phase-referencing technique and the 
extended structure of the check source and phase calibrator (around 2 
mas), as explained previously. Uncertainties were added in quadrature. 

Finally, we searched for continuum radio counterparts of FRB 180916. 
JO158+65 by imaging the full interferometric dataset with a field-of- 
view of 2 arcsec x 2 arcsec centred at the position of the bursts (and 
thus also covering the full extent of the nearby star-forming region). 
We detected no emission at the position of FRB 180916.J0158+65 as 
well as no significant (>4.50) compact emission anywhere in the field 
above the r.m.s. noise level of 10 py per beam (19 wy per beam when 
excluding the data from the Tianma station). We also searched for pos- 
sible emission from the core of the galaxy, but found no signal above 
the 6or.m.s. noise level ina 4 arcsec x 4 arcsec area. 


VLA observations 

InJune 2019, we used the VLA to perform a quasi-simultaneous search 
for millisecond transients and faint persistent emission associated 
with FRB 180916.JO0158+65. The VLA observed from 1 GHz to 2 GHz 
with baseline lengths up to 11 km, giving a synthesized beam size of 
roughly 4 arcsec. VLA visibilities were sampled at 5-ms time resolu- 
tion and searched in real time by the Realfast system*; the same data 
were integrated at 3-s time resolution and calibrated with the standard 
CASA calibration pipeline. Ina total of 14 h of observing (with on-target 
efficiency of 80%), we found no millisecond transients to an 8a limit 
of approximately 50 mJy per beam in 5 ms or, equivalently, a fluence 
limit of 0.25 Jy ms. No VLA observing was coincident with known burst 
times detected by the EVN. We concatenated all 14 h of data, applied an 
iteration of phase self-calibration using relatively bright background 
sources in the field of view of the antennas, and imaged 600 MHz of the 
bandwidth centred near 1.6 GHz to search for persistent radio emission 
on arcsecond scales. We confirmed that the astrometric accuracy of 
the deep image is good to better than 1 arcsec by associating seven 
bright field radio sources to the catalogued location in the NRAO VLA 
Sky Survey®. No persistent emission is detected at the position of FRB 
180916.J0158+65 brighter than a 30 limit of 18 py per beam. A 25-Wy 
source is located at a nearby J2000 position of a=01h58 min 00.9 s, 


6=65° 43’ 07” (see Extended Data Fig. 2), but this position is offset by 
approximately 7 arcsec from the location of FRB 180916.J0158+65, 
which is much more than the astrometric precision, estimated to be 
0.4 arcsec (the VLA observed in its B configuration at the time). It is 
thus very unlikely to be associated with FRB 180916.J0158+65 or the 
host galaxy. 


Optical observations and host galaxy 

The EVN localization of FRB180916.J0158+65 places it at the outskirts of 
SDSSJ015800.28+654253.0, an apparent Sb spiral galaxy (possibly with 
afaint bar) with an estimated” photometric redshift z,,,,.= 0.07 + 0.05. 

We obtained imaging and spectroscopic observations of the host 
galaxy using the GMOS spectrograph on the Gemini-North telescope 
between July and September 2019 (programme ID GN-2019A-DD-110). 
We acquired 2 x 900-s exposures with each of the g’ and r filters. The 
images were processed with the Gemini IRAF/pyraf package, and 
sources were extracted using SExtractor®*. The images were de-warped 
and astrometrically corrected by matching the point source positions 
to those from the Gaia Data Release 2 (DR2) Catalogue’. The matching 
precision was 17-20 mas r.m.s. for each image. The seeing was measured 
to be 0.99 and 0.78 arcsec in the g’ andr’ images, respectively. 

Figure 3 shows the ’’ image of the galaxy, along with the position 
of FRB 180916.J0158+65. The galaxy is nearly face-on, with spiral arm 
structures visible. FRB 180916.J0158+65 appears to be co-located with 
one of the outer arms of the galaxy, near a bright clump inthe?’ image 
(see Extended Data Fig. 4 for amore detailed zoom on this region). 
We attribute this emission to that of Ha, whichis bright in emission in 
the GMOS spectra and, at the redshift of the host galaxy, falls within 
ther’ image. The total stellar mass of the galaxy has been estimated to 
be about 10” times the mass of the Sun by using the WISE W1 and W2 
colours”, which provide amore robust estimation given that they are 
less affected by extinction. 

We performed long-slit optical spectroscopy, simultaneously tar- 
geting the galaxy core and the FRB location, in order to measure the 
redshift and spectral properties of the galaxy and the location of FRB 
180916.J0158+65. We acquired 4 x 900-s spectroscopic observations 
with a1.5-arcsec slit and the R400 grating—along with corresponding 
flat fields, copper-argon arcs, and observations of BD +28 4211 to cor- 
rect for sensitivity and extinction. As shown in Extended Data Fig. 4, the 
slit was rotated to align with the host galaxy centre and FRB position. 

The spectroscopic data were processed with the Gemini IRAF/pyraf 
package. The FWHM of the emission lines from the sky was measured 
to be 7-8 A, matching the theoretical resolving power of R= 640. The 
wavelength calibration with copper-argon emission lines had a preci- 
sion of 0.7-0.8 A. However, the lack of arc lines towards the bluer edge 
reduces the accuracy of the wavelength solution towards 5,200 A. 

We extracted the spectrum within 5 arcsec of the galaxy centre as well 
as 2 arcsec around the location of FRB 180916.J0158+65 itself. We used 
the trace of the galaxy centre, offset by 7.7 arcsec to trace the location 
of FRB180916.J0158+65. Figure 3 shows the calibrated spectra at each 
location separately. We identify redshifted Ha, NII, S 11and O11 lines 
with an average redshift of z= 0.033 + 0.001 if all lines are included and 
z= 0.0337 + 0.0002 when excluding the O 111 lines that are at the blue 
edge of the charge-coupled device (CCD)—and which hence may have 
reduced wavelength calibration accuracy. Extended Data Table 2 shows 
the line centroids, widths and fluxes. The spectrum at the location of 
FRB 180916.J0158+65 is dominated by emission lines with little con- 
tinuum emission. The O 115,007 A line is detected at the FRB location 
but was corrupted by acosmic-ray hit. As the slit was positioned at the 
edge of the nearby star-forming clump, we observe a non-uniform slit 
illumination pattern and the line centroids, especially for Ha, are shifted 
towards the redder wavelengths by about 1 A. Nevertheless, we can 
confirm that both the clump and the galaxy are at the same distance. 

The flux obtained for the Ha emission at the position of FRB180916. 
JO158+65 (which is dominated by the star-forming clump) implies a 


Article 


luminosity of L,,, = (2.0 + 0.1) x 10” ergs™, which corresponds to astar- 
formation rate® of about 0.016M, yr’. These values were extracted 
froma region of 1.5 arcsec x 2 arcsec around the FRB, corresponding 
to 1.5 kpc’, implying a star-formation surface density of about 107M, 
yr ‘kpc. Note that these values can be strongly affected by extinction, 
whichis estimated to be E(B - V) ~ 1.01. 

From the Ha, N Iland S 1 line fluxes, we estimate the metallicity of 
the galaxy“ to be 12 + log(O/H) = 8.82, close to solar neighbourhood 
metallicity, and five times higher when compared to the host of FRB 
121102 (<8.1 on the same metallicity scale”). 


Chance alignment probability with host galaxy 

The projected position of FRB 180916.J0158+65 is coincident witha 
star-forming clump in the spiral galaxy SDSS JO15800.28+654253.0. 
To confirm an association with the galaxy we have estimated the prob- 
ability of chance coincidence between the two sources. 

We have considered a uniform density of galaxies across the sky. 
Assuming a Poisson distribution of galaxies in the relevant co-moving 
volume, we estimate the number density of typical galaxies brighter 
than M, = -16 (that is, galaxies with masses greater than about 40% of 
the dwarf galaxy host” of FRB 121102) based on theall-galaxy luminosity 
function, and their typical size distribution based on the galaxies 
reported in SDSS DR7™. Then, we calculate the chance coincidence 
probability for asource to be co-located within twice the median half- 
light radius of any galaxy as a function of redshift. 

At the measured redshift of z = 0.0337, the chance coincidence 
probability P« 0.1%. However, we may consider the probability up to 
a redshift of approximately 0.11, the maximum redshift considered 
for FRB 180916.J0158+65 based on the observed DM*®. Evenin this case 
P<1% (see Extended Data Fig. 8). The association between FRB 180916. 
JO158+65 and the galaxy is thus statistically robust. 


DM budget 

FRB 180916.J0158+65 is close to the Galactic plane, with longitude 
[=129.7° and latitude b = 3.7°. The total measured DM of FRB 180916. 
JO158+65 is DM,.,=348.76 pccm, andis expected to be the sum of con- 
tributions from the Milky Way disk (DM) and halo (DM,,),), the interga- 
lactic medium (DM,,,), and host galaxy (DM,,,;-3 including both the ISM 
and halo of the host). For the line-of-sight to FRB 180916.J0158+65, DMyw 
is predicted to be 199 pccm ?and 325 pccm “by the NE2001and YMWI16 
Galactic electron density models, respectively°°. DM,,,1, iS estimated to 
be 50-80 pc cm” or approximately 45 pc cm, according to two avail- 
able models”. Given the range of model predictions, one can ascribe 
up to roughly 100 pc cm® to DM,,,, and DM,,,,.,- For redshift z= 0.0337, 
DM, mis expected to be” approximately 34 pc cm *—although this esti- 
mate is highly dependent online of sight for low redshift®. Overall, after 
subtracting these various contributions, DM,,., is probably less than 
70 pccm®?, and could be muchsmaller. The YMW16 model over-predicts 
DMywinthis direction (unless the DMy,a1,, DMigyand DM,,s contributions 
are all muchsmaller than expected), and this illustrates that FRB DMs will 
help constrain Galactic electron density models. Conversely, because the 
line of sight to FRB 180916.J0158+65 passes through the Galactic plane, 
witha large but uncertain Galactic DM contribution, itis unlikely to bea 
good probe for refining the DM-z relationship in the IGM”. 


Comparison with the other localized repeating FRB 
The physical origin of the FRB phenomenon remains unclear, and many 
of the proposed models? remain consistent with the observational facts 
we present here for FRB 180916.J0158+65. We now briefly consider the 
nature of FRB 180916.J0158+65 by comparison to two of the families 
of models proposed for FRB 121102, whichis the FRB source studied in 
most detail to date, and the only other well localized repeater. 

One of the models to explain the associated persistent radio counter- 
part to FRB 121102 proposes the existence of a 20-50-year-old super- 
nova remnant and nebula powered by a young flaring magnetar”?”, 


The absence of a similarly bright radio nebula associated with FRB 
180916.J0158+65 could be explained, in this model, as an older system 
whose emission has already faded. Assuming a similar model”’, the 
upper limits on persistent radio emission associated with FRB 180916. 
JO158+ 65 are consistent with a system that is at least about five times 
older than FRB 121102, and thus at least 200-500 yr old. Additionally, 
the same modelalso predicts a characteristic decay of the RM withtime. 
An RMas large as the one observed towards FRB 121102” would drop 
to the RM observed towards FRB 180916.J0158+65 at a source age of 
about 300 yr. However, while the results we present here are conceiv- 
ably consistent with the young magnetar scenario for repeating FRBs, 
we find no new support for this scenario. 

Other models, inspired originally by FRB 121102, invoke interaction 
between a nearby strong plasma stream and a neutron star’s magneto- 
sphere”. In the case of FRB 121102, the large” RM and the observational 
similarity” of its compact, persistent radio source to low-luminosity 
active galactic nuclei suggest that the bursts could originate froma 
massive black hole, or a neutron star in the near vicinity. An accreting 
massive black hole can provide a plasma stream that could interact 
with aneutron star magnetosphere”. The absence of a persistent radio 
counterpart to FRB 180916.J0158+65, and its location inthe spiral arm 
ofamorphologically well defined galaxy, make sucha model arguably 
less likely in the case of FRB 180916.J0158+65. For a black hole witha 
similar mass to that of the one considered for FRB 121102 (10°M, to 
10°M.), our upper limits would imply a very low accretion rate of about 
107 ¢aq Where Lag is the Eddington luminosity. Nonetheless, this accre- 
tion rate is still compatible with the low end observed in low-luminosity 
active galactic nuclei”. The presence of a massive black hole at the 
apex of the star-forming clump is thus not excluded, although rather 
unlikely. Nonetheless, in an interacting model”, the plasma stream 
can also have a different origin, and it is conceivable that FRB 180916. 
JO158+65 is a neutron star interacting with, for example, the jet of a 
stellar-mass black hole. 

The proximity of FRB 180916.J0158+65 will aid in performing deep 
multi-wavelength observations, which may help to discriminate 
between the various scenarios outlined above, while potentially also 
suggesting new ones. 


Data availability 


The datasets generated from the EVN observations and analysed in 
this study are available at the Public EVN Data Archive (http://www. 
jive.eu/select-experiment) under the experiment code EM135C. The 
datasets generated from the Gemini observations are available at the 
Public Gemini Observatory Archive (https://archive.gemini.edu) with 
Program ID GN-2019A-DD-110. 


Code availability 


The codes used to analyse the data are available at the following sites: 
AIPS (http://www.aips.nrao.edu/index.shtml), CASA (https://casa.nrao. 
edu), Difmap (ftp://ftp.astro.caltech.edu/pub/difmap/difmap.html), 
IRAF (http://ast.noao.edu/data/software), PRESTO (https://github.com/ 
scottransom/presto), and PSRCHIVE (http://psrchive.sourceforge.net). 
This research made use of APLpy, an open-source plotting package 
for Python hosted at http://ap|Ipy.github.com, Astropy, acommunity- 
developed core Python package for Astronomy”, and Matplotlib”. 
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Extended Data Fig. 1| Burst detections in Effelsberg PSRIX data. Band- 
averaged profiles and dynamic spectra of the four bursts, as detected in the 
PSRIX data (a, b, cand d). A20-ms time window is shown surrounding the burst 
centre. Each burst was fitted with a Gaussian distribution to determine the 
FWHM duration, whichis represented by the cyan bars. The lighter cyan 
encloses the 2oregion. The solid white lines are frequency channels that have 
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been removed from the data due to either RFI or subband edges, indicated by 
the red and blue markers, respectively. For visual clarity, bursts B1, B2 and 

B3 (a, bandc) are downsampled in both time and frequency by a factor of two. 
The RFI excision was done before downsampling. The time and frequency 
resolution used for plotting is shown in the top right of each panel. 
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Extended Data Fig. 2| VLA field image. Field of the continuum radio emission the grey ellipse in the bottom-left corner. Note that a faint source is detected at 
around FRB 180916.J0158+65 as seen by the VLA at 1.6 GHz with a bandwidth of around 6arcsec north of FRB 180916.J0158+65, but its separation is significant 
0.6 GHz. The position of FRB 180916.J0158+65 is marked by the red cross at the (>3o0 confidence level) and we thus conclude that it is not associated with FRB 
centre of the image. Contours start at the 3or.m.s. noise level of 18 pJy per 180916.JO158+65. 

beam and increase by factors of /2. The synthesized beam is represented by 
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Extended Data Fig. 3| Full field of view of the Geminir’ filter. The position of FRB 180916.JO158+65 is highlighted by the red cross. Note that the spiral galaxy 
associated with FRB 180916.J0158+65 is the only clearly visible galaxy in the field. 
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Extended Data Fig. 4| Zoomed-in images at the position of FRB 180916. orientation and placement of the 1.5-arcsec spectroscopic slit used to obtain 
J0158+65. Gemini data at r’ (a) andg’ bands (b). The position of FRB180916. the optical spectra. Note that the slit does not cover the full star-forming region 
J0158+65 is highlighted by the white cross. The uncertainty in its position is but the region centred on FRB180916.J0158+65, and that the whole regionis 


smaller than the resolution of these images. The dashed lines represent the strongly affected by extinction (E(g-r) =1.73+ 0.09). 
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Extended Data Fig. 5 | Host galaxy source of ionization. Emission line flux 
ratios of [N 1]/Ha (left) and [S 11]/Ha (right) plotted against [O 111]/HB. The 
greyscale distribution represents the SDSS DR12 sample” of 240,000 galaxies 
that display significant emission lines (>50), where the solid and dotted grey 
lines denote the demarcations between star-forming and AGN-dominated 
galaxies”. The host galaxies of FRB 121102 and FRB 180924 (shown by the 
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greenand red circles, respectively, where error bars show the louncertainty) 
are consistent with star-forming and AGN-dominated galaxies, respectively”. 
Though the Gemini-North spectrum of FRB 180916.J0158+65 does not cover 
the [O 111] and HB lines, its [N 11]/Ha and [S 11]/Ha line ratios are broadly 
consistent with a star-formation-dominated galaxy (represented by the 
vertical lines and the loregion as linewidth). 
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Extended Data Fig. 6| Burst brightness and arrival times. Burst S/Nasa crosses represent the four bursts (from left to right: B1, B2, B3, B4). The black 
function of time during our 2019 June 19 observation of FRB 180916.J0158+65. dashed line indicates the detection threshold of our searchin the pulsar- 
The grey bars represent scans of the FRB 180916.J0158+65 field. The red backend data (S/N=7). 
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Extended Data Fig. 7 | Auto-correlation function and scintillation A Lorentzian fit is shownin green in b. The black vertical dashed line represents 
bandwidth of brightest burst, B4. a, The ACF of the spectrum of the bright, the scintillation bandwidth, defined as the half width at half maximum of the 
narrow burst component of burst B4. b, The ACF for lags between -1.016 MHz Lorentzian fit. 


and +1.016 MHz. The zero-lag noise spike has been removed from the ACF. 


Probability of chance coincidence 


---- FRB 180916.J0158+65 (maximum redshift) 
---- FRB 180916.J0158+65 (host redshift) 
to ~FRB121102-like galaxy with M, < -16.0 

~ Any galaxy with M; < -16.0 
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Extended Data Fig. 8 | Redshift-cumulated probability of chance alignment threshold. At the redshift of the host galaxy, z= 0.0337 (vertical dashed red 
coincidence. Probability of achance alignment between FRB 180916.JO158+65 line), the chance coincidence probability is P< 0.1%, and at the maximum 

and twice the median half-light radius of any galaxy with magnitude M, <-16 possible redshift of about 0.11 derived from the observed DM the probability is 
(orange region) or witha dwarf galaxy like the host of FRB 121102 (blue region) <1% (vertical dashed black line). 

asa function of redshift. The horizontal grey line represents the 1% probability 
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Extended Data Table 1| FRB 180916.J0158+65 burst properties as detected in both the Effelsberg PSRIX data and the 
Effelsberg auto-correlation data 


PSRIX data Auto-correlations 


Burst Peaktime? (MJD) S/N DM(pccm-) Width(ms) S/N DM(pccm %) Width (ms) 


B1 58653.0961366466 14.65 345 1.64 9.87 348.76 2.40 
B2 58653.1112573504 9.48 349 0.33 9.61 348.76 0.32 
B3 58653.1465969404 11.08 356 0.74 9.78 348.76 1.12 
B4 58653.2785078914 46.30 350 0.49 65.42 348.76 0.06 


S/N and width are determined using single_pulse_search.py, and are the exact values returned by this program for the best-fit boxcar function. The DM quoted for the PSRIX data are the DM 
corresponding to the optimal S/N detection. The DM for the Effelsberg auto-correlation data are 348.76 pc cm *®, which is the S/N optimizing DM found for the brightest burst, B4, using the 
PSRCHIVE tool pdmp. 

*Time of arrival (in Modified Julian Day, MJD) of the centre of the burst envelope at the Solar System barycentre after correcting to infinite frequency (that is, after removing the time delay from 
dispersion) using a DM of 348.76 pc cm®. 


Extended Data Table 2 | Properties of the spectral emission lines 


Line ID Galaxy Core FRB Location 
Centroid Width Flux Centroid Width Flux 
(A) (A) (10-6 ergcm-? s") (A) (A) (10-1§ ergcem-* s~') 

0 4958.93A 5122.7 55 0.38(4) = = = 

©), 5006.86 At 5155.6 5.6 4.76(4) = = = 

Ny 6548.05 A 6767.9 5.5 0.31(4) 4 és = 
Ha6562.8A 6784.8 10.9 6.57(4) 6787.5 6.5 1.002(8) 

Ni 6583.45 A 6806.9 10.3 2.51(4) 6809.0 51 0.233(8) 
$6716.44 A 6943.7 7.4* 0.77(4) 6946.2 6.417 0.243(5) 

S 6730.81 A 6953.8 7.4" 0.58(4) 6961.8 6.411 0.123(5) 


Properties of the most relevant spectral lines observed at the location of the core of the galaxy and at the position of FRB 180916.JO158+65. Note that the fluxes are not corrected for extinction. 
Numbers in parentheses indicate 10 uncertainties in the least significant digits. 

+The O1115,006.86 A line is detected at the FRB location but is corrupted by a cosmic-ray hit. 

* ttA single value of linewidth was fitted for both lines. 
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Nonlocal qubit interactions are a hallmark of advanced quantum information 
technologies'>. The ability to transfer quantum states and generate entanglement 


over distances much larger than qubit length scales greatly increases connectivity and 
is animportant step towards maximal parallelism and the implementation of two- 
qubit gates on arbitrary pairs of qubits®. Qubit-coupling schemes based on cavity 
quantum electrodynamics”’* also offer the possibility of using high-quality-factor 
resonators as quantum memories*”. Extending qubit interactions beyond the nearest 
neighbour is particularly beneficial for spin-based quantum computing architectures, 
which are limited by short-range exchange interactions”. Despite the rapidly 


maturing device technology for silicon spin qubits 


16 experimental progress 


towards achieving long-range spin-spin coupling has so far been restricted to 
interactions between individual spins and microwave photons” ”°. Here we 
demonstrate resonant microwave-mediated coupling between two electron spins 
that are physically separated by more than four millimetres. An enhanced vacuum 
Rabi splitting is observed when both spins are tuned into resonance with the cavity, 
indicating a coherent interaction between the two spins and a cavity photon. Our 
results imply that microwave-frequency photons may be used to generate long-range 
two-qubit gates between spatially separated spins. 


The devices studied in this work consist of two double quantum dots 
(DQDs), denoted L-DQD (left DQD) and R-DQD (right DQD), that are 
fabricated ona Si/SiGe heterostructure and positioned at the left and 
right antinodes, respectively, of ahalf-wavelength Nb superconducting 
cavity (Fig. 1a). Device lis fabricated on a natural-silicon quantum well 
and has acavity centre frequency of f. = 6.745 GHz and a decay rate of 
K/(21t) = 1.98 MHz. Device 2 uses an enriched *°Si quantum well with a 
2°Si residual concentration of 800 ppm. A single electron is isolated in 
each DQD” and interacts with the electric field of the cavity through the 
electric dipole interaction. Our device design uses a split-gate cavity 
coupler (labelled ‘CP’ in Fig. 1b) that is galvanically connected to the 
central pin of the superconducting cavity”. 

We first demonstrate the strong coupling of a spin trapped in each 
DQD to acavity photon. In our device architecture, a large electric 
dipole coupling rate of g./(21t) = 40 MHzis combined with an artificial 
spin-orbit interaction generated by a micromagnet to achieve spin- 
photon coupling’””™*. To probe the spin-photon coupling, the cavity 
transmission A/A, is plotted as a function of the cavity probe frequency 

fand B™* in Fig. 1c. The Zeeman splitting increases with the total mag- 
netic field B°* = B™ + B™, where B™ is the externally applied field and 
B™ is the stray field of the micromagnet. Field tuning allows us to bring 
each spin into resonance with a cavity photon of energy hf, where h 
is Planck’s constant. We observe coherent coupling between the spin 
trapped in the L-DQD and the cavity photon, as evidenced by the vac- 
uum Rabi splitting when B™ = 109.1 mT. Strong spin-photon coupling 
is achieved, with the spin-cavity coupling rate g, ,/(2™) =10.7 + 0.1MHz 
(all error estimates in the manuscript correspond to one standard 


deviation) exceeding the cavity decay rate x/(2Tt) = 1.98 MHz and the 
spin decoherence rate y, ,/(21) = 4.7 MHz (see Supplementary Infor- 
mation). Likewise, we observe strong spin-photon coupling for theR 
(right) spin at B™ = 103.1 mT, with g, ,/(21) =12.0 + 0.2 MHz exceeding 
Vs,p/ (211) = 5.3 MHz and «/(2Tt). Factoring in the susceptibility of the 
micromagnet, y= 0.6, the 6-mT difference in B™ is equivalent to a 268- 
MHz difference in the spin resonance frequencies and precludes the 
observation of resonant coupling between the two spins’. 

Tocompensate for local differences in the magnetic field, the device 
was fabricated with the long axis of the Co micromagnets tilted by 
6=+15° relative to the interdot axis of the DQDs (Fig. 1b). Because of this 
intentional asymmetry, adjusting the angle g of the in-plane magnetic 
field relative to the DQD axis provides an additional degree of freedom 
for simultaneous tuning of both spins into resonance with the cavity. 
Qualitatively, the high-permeability Co micromagnet concentrates 
the magnetic-field lines, leading to a maximum total field when B™* 
is aligned with the long axis of the micromagnet”. We show that this 
technique is suitable for controlling the spin resonance frequencies 
in Fig. 1d, where we set the magnetic-field angle at @ = 2.8°. Here, the 
Rspin resonance condition shifts up to B™ = 104.7 mT and the L spin 
resonance condition shifts down to B™ = 107.4 mT. 

We now demonstrate control over the difference between the spin 
resonance frequencies by fixing the magnitude of the external magnetic 
field B™ and varying the angle g of B™ in the plane of the sample. The 
expected spin resonance frequencies are plotted in Fig. 2a as a func- 
tion of g with B™ = 106.3 mT (upper panel) and B™ = 110 mT (lower 
panel), confirming that these two control parameters will bring the 
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Fig.1| Cavity coupler for spins. a, Optical micrograph of the superconducting 
cavity containing two single-electron DQDs. The electron spinin each DQD is 
coupled to the cavity through a combination of electric-dipole and artificial 
spin-orbit interactions. z indicates the direction of the DQD interdot axis. 

b, False-colour scanning electron microscope image of L-DQD. A double-well 
potential is formed beneath the plunger gates P1, and P2,, and the barrier gate 
B2, is used to adjust interdot tunnel coupling. Spin—-orbit coupling is induced 


two spins into resonance with the cavity and with each other. On the 
basis of microwave spectroscopy measurements of the spins, we expect 
resonance to occur around g = 6° and B™ = 106.3 mT. Figure 2b maps 
out the field angle dependence of the spin resonance frequencies over 
the range g = 3°-8° with B™ = 106.3 mT. The resonance frequency of 
the R spin monotonically moves to lower f values as @ is increased, 
whereas the L spin shows the opposite dependence. With @=5.6° both 
spins are tuned into resonance with the cavity. These results are well 


f (GHz) 


6.76 


7 \ 2 
7 ~ 
Bt =110mT |~ 
é ss <Y 6.72 
Es , 
~ at 
6.70 
6.6 i 


10 3 4 ie) 
9 (°) 

Fig. 2| Tuning two spatially separated spins into resonance. a, Expected spin 

resonance frequencies asa function of g for B™' = 106.3 mT (top panel) and 

B™*=110 mT (bottom panel). B™ allows control over both spin frequencies with 

respect to the cavity, whereas @ allows control over the spin frequencies with 

respect to each other. The frequency of the L (R) spinis plotted in blue (purple). 
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by aCo micromagnet (dashed lines). CP, cavity coupler. S1,,S2,,S,,D, denote 
other gates used to define the LDQD.c, Cavity transmission A/Ay as a function 
of B™ and f. Vacuum Rabi splitting, a hallmark of strong coupling, is observed 
for each spin. Fora field angle of g=0° the Rspin and L spin resonance 
conditions are separated by AB™'=6 mT. d, Increasing the field angle to p=2.8° 
reduces the field separation drastically. 


captured by the theoretical prediction of transmission through the 
cavity shown in Fig. 2c. 

The spectrum of the Jaynes-Cummings model for a single spin and 
a single photon in the cavity is shown in Fig. 3a. With the L spin tuned 
into resonance with the cavity, the spin and cavity photon hybridize, 
leading toa vacuum Rabi splitting of magnitude 2g, , inthe cavity trans- 
mission’. By contrast, when both spins are simultaneously tuned 
into resonance with the cavity, the excited-state spectrum of the 


Bext = 106.3 mT 


5 6 
¢ (’) 


b, A/A, asa function of fand g, demonstrating simultaneous tuning of both 
spins into resonance with the cavity at p=5.6° and B™' = 106.3 mT. Dashed lines 
indicate LspinandR spintransition frequencies. c, Theoretical prediction for 
A/Ao. 
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Fig. 3 | Resonant coupling of the two spins viaa cavity photon. a, Tuning theL 
spininto resonance with the cavity results in vacuum Rabi splitting with 
magnitude 2g,,.A vacuum Rabi splitting of magnitude 28.1p= 2, lez, +82, is 
expected when both spins are tuned into resonance with the cavity. b, A/A)asa 
function of fand B™ with g =5.6°, indicating an enhanced vacuum Rabi 
splitting when the L spin and R spin are tuned into resonance with the cavity. 
c,A/Ay as a function of ffor the R spin in resonance (upper curve), the Lspinin 


Jaynes-Cummings model splits into a sub-radiant state and two 
bright states. For Nidentical spins, the Jaynes-Cummings model pre- 
dicts a collective /N enhancement of the coupling rate’. In our 
device geometry, the sub-radiant state is the spin triplet 
10, To) = (1//2)(|0, *,)+|0,¥,%)) because the DQDs are located at 
opposite ends of the cavity, where the electric fields are 180° out of 
phase. Here, the spin states of the L/R spin are quantized along the axis 
of their local total magnetic field. The bright states are hybridizations 
between the singlet state |0, Sp) = (1/2) (10, *,+)-|0,¥,%)) and the 
state with a single photon|I, V, ), which are separated in energy by 
twice the collectively enhanced vacuum Rabi coupling, 
28.1n=2: [g2, een . We now search for evidence of cavity-mediated 
single-spin coupling. 

Figure 3b shows A/A, as a function of fand B™ with g = 5.6°, where 
both spins are in resonance with the cavity. As B™ is increased, both 
spins are simultaneously tuned into resonance with the cavity at 
B= 106.3 mT and we observe an enhancement in the vacuum Rabi 
splitting relative to the datasets shown in Fig. 1c, d. The vacuum Rabi 
splitting is quantitatively analysed for device 1in Fig. 3c and for device 
2 in Fig. 3d. These devices have slight differences in gate geometry, 
and the micromagnets in device 2 are canted at angles of @=+10°. In 
device 1, we extract g, ,, by measuring A/A, at g = 5.6° by decoupling 
the R/L spin from the cavity. This is achieved by increasing the charge 
level detuning (¢,/€,) inthe respective DQD to localize the electronina 
single quantum dot”. In device 2, we extract g, , , by measuring A/A, at 
an off-resonance angle, at the now separated resonant fields (see Sup- 
plementary Fig. 1). For device 1, we observe a vacuum Rabi splitting of 
2g,,/(21) =21.4 + 0.2 MHz when the Lspinis in resonance with the cavity, 
28.,.p/(21) =24.0 + 0.4 MHz when the R spinisin resonance with the cav- 
ity, and 2g, | p/(21t) = 30.2 + 0.2 MHz when both spins are in resonance 
with the cavity. Device 2 shows similar behaviour and again exhibits an 


f (GHz) 


resonance (middle curve) and both spins in resonance (bottom curve). The 
enhancement of the vacuum Rabi splitting with both spins in resonance with 
the cavity is indicative of coupling between the two spins via the cavity mode. 
d, Cavity-assisted spin coupling is also observed inasecond device witha 
different gate pattern. Dashed lines inc, dare fits toa master-equation 
simulation. Insets inc, d are scanning electron microscope images of the 
devices, with scale bars of 200 nm. 


enhanced vacuum Rabi splitting of 2g, , ,/(21™) =18.4 + 0.4 MHzwhenboth 
spinsaretunedintoresonancewiththe cavity, compared totheindividual 
splittings 2g, ,/(21m) = 13.2 + 0.2 MHz and 2g, ,/(21t) = 12.4 + 0.2 MHz. 
Combined, these two datasets give strong evidence for microwave- 
assisted interactions between two spins across a length scale of 4mm, 
which is many orders of magnitude larger than what can be achieved 
using direct wavefunction overlap. Moreover, these measurements 
show that both the field angle and DQD-level detuning can be used 
to modulate the strength of the interactions between the two spins. 

The data in Fig. 3c, d are fitted using a master-equation description 
of the spin-cavity system (see Supplementary Information). We inde- 
pendently measure the linewidths y, |, using microwave spectroscopy. 
The cavity linewidth x and the complex Fano factor g are obtained by 
fitting the bare cavity response with the spins detuned from resonance. 
The spin-photon coupling rates g, |p for each device are obtained by 
fitting the data obtained with the spins individually tuned into reso- 
nance with the cavity, as shownin Fig. 3c, d. From the Jaynes-Cummings 
model we expect the bright states to split with the enhanced collective 
coupling rate 28. tp = 21 ze. +o i The extracted splittings agree with 
the theoretical predictions of 2g, ,p/(21) = 32.1 MHz for device land 
28;,.p/(21t) = 18.1 MHz for device 2 to within 6%. 

The observation of enhanced vacuum Rabi splitting when the sepa- 
rated spins are simultaneously in resonance with the cavity is evidence 
ofalong-range interaction between the two spins and acavity photon. 
The nonlocal interaction of two spins marks an important milestone 
for all-to-all qubit connectivity and scalability in silicon-based quan- 
tum circuits. With further improvements in cavity quality factors and 
spin-photon coupling rates’, time-domain demonstrations of cavity- 
assisted spin-spin coupling should be within experimental reach. The 
spin-photon coupling rate could be considerably increased by using 
high-impedance superconducting cavities””°, potentially enabling 


Nature | Vol577 | 9 January 2020 | 197 


Article 


spin-spin coupling in the dispersive regime, where the spins are 
detuned from the cavity and the photonic degree of freedom can be 
effectively eliminated”. High-fidelity long-range spin-spin coupling in 
the dispersive regime would allowthe implementation of modular qubit 
architectures in silicon, where nearest-neighbour coupled registers 
of spin qubits”° can be interfaced with other sparsely distributed 
registers via microwave photons. 
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Bulk amorphous materials have been studied extensively and are widely used, yet 
their atomic arrangement remains an open issue. Although they are generally 
believed to be Zachariasen continuous random networks’, recent experimental 
evidence favours the competing crystallite model in the case of amorphous silicon? *. 
In two-dimensional materials, however, the corresponding questions remain 
unanswered. Here we report the synthesis, by laser-assisted chemical vapour 
deposition’, of centimetre-scale, free-standing, continuous and stable monolayer 
amorphous carbon, topologically distinct from disordered graphene. Unlike in bulk 
materials, the structure of monolayer amorphous carbon can be determined by 
atomic-resolution imaging. Extensive characterization by Raman and X-ray 
spectroscopy and transmission electron microscopy reveals the complete absence of 
long-range periodicity and a threefold-coordinated structure with a wide distribution 
of bond lengths, bond angles, and five-, six-, seven- and eight-member rings. The ring 


distribution is not a Zachariasen continuous random network, but resembles the 
competing (nano)crystallite model*. We construct a corresponding model that 
enables density-functional-theory calculations of the properties of monolayer 
amorphous carbon, in accordance with observations. Direct measurements confirm 
that itis insulating, with resistivity values similar to those of boron nitride grown by 
chemical vapour deposition. Free-standing monolayer amorphous carbon is 
surprisingly stable and deforms to a high breaking strength, without crack 
propagation from the point of fracture. The excellent physical properties of this 
stable, free-standing monolayer amorphous carbon could prove useful for 
permeation and diffusion barriers in applications such as magnetic recording devices 
and flexible electronics. 


Amorphous materials are used in a wide range of applications, but their 
atomic-scale structure and its effect on properties are far more complex 
than those of crystalline analogues. Inaclassic 1932 paper, Zachariasen’ 
invoked free-energy arguments to propose that amorphous materi- 
als comprise the same bonding units as their crystalline analogues, 
but these units form continuous random networks (Z-CRNs) instead 
of periodic structures. A competing model, namely, the existence of 
crystallites embedded in an otherwise CRN environment, is even older’ 
(the crystallite model of amorphous solids is distinct from nanocrys- 
talline materials, as the latter comprise nanoscale grains separated 
by grain boundaries). In the past few decades, the Z-CRN model has 
gained wide acceptance, especially for amorphous Si and SiO, (a-Si 
and a-SiO,), which are viewed as prototypes. The primary experimental 


evidence was provided by radial distribution functions, which could 
be reproduced accurately by model Z-CRNs. More recently, however, 
experiments that probe beyond pair correlations have found extensive 
concentrations of nanocrystallites of order 1-2 nm, whereas the cor- 
responding models fit the experimental radial distribution functions 
just as well as Z-CRN models? *. The inability to image the atomic-scale 
structure of bulk amorphous materials directly by microscopy means 
that the debate over CRNs versus crystallites has remained unresolved®. 

For two-dimensional (2D) atomically thin materials, the nature of the 
amorphous state can in principle be resolved by direct atomic-resolu- 
tion imaging. Monolayer amorphous carbon (MAC) can be viewed as 
a prototype amorphous 2D material, an analogue of monolayer crys- 
talline carbon (graphene). Atomic-resolution transmission electron 
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Fig. 1|Morphology of MAC. a, MAC, 5 x 5cm/?, transferred onto a SiO,/Si wafer. 
b, Scanning electron microscope (SEM) image of MAC suspended ona TEM grid 
has uniform contrast throughout. Silicon nanoparticles have been 
intentionally suspended onthe MAC to emphasize the presence of the film. 
Scale bar, 200 nm. c-f, Characterization of MAC grown on different substrates. 
c, Raman spectra for MAC grown on copper and gold substrates were measured 
after transferring to SiO,. The Raman spectrum for MAC onruthenium was 
measured directly on the growth substrate. d, Raman spectra for growth on 
copper, showing the D andG bands with the fitted curve and an/,/I, ratio of 
0.82.e,C 1s XPS spectra measured directly on different growth substrates. 

f, High-resolution C 1s XPS spectra on copper, with the fitted curve showing a 
single carbon sp” peak at 284.0 eV. 


microscopy (TEM) has been used to image freestanding graphene mon- 
olayers irradiated with TEM electron beams to induce disorder™”’. Such 
monolayers, however, are limited in size, generally inhomogeneous 
because of carbon loss and, when crystallites are still present, they are 
not necessarily randomly oriented, as they would be in a truly amor- 
phous material, and can be eliminated by further irradiation. Similarly, 
radiation of aromatic self-assembled monolayers on substrates pro- 
duces nanocrystalline graphene with only small amorphous patches". 

Direct synthesis of MAC samples remains a challenge. Recently, the 
synthesis of amorphous carbon monolayers on Ge substrates using 
conventional chemical vapour deposition (CVD) was reported”. 
These samples were grown at high temperatures (>900 °C), resulting 
in amorphous regions (about 300 nm) that are embedded in a gra- 
phene matrix. The reported TEM images lack atomic resolution to 
establish the structure unambiguously. Instead, the authors concluded 
in favour of a Z-CRN structure on the basis of the halos in diffraction 
patterns derived from fast Fourier transform patterns of TEM images 
of regions 2.5 x 2.5 nm?. However, many of the halos exhibit residual 
six-dot motifs—that is, the existence of crystallites 1-2 nm across can- 
not be ruled out. 
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Here we report the synthesis of MAC by alaser-assisted CVD growth 
process. The self-limiting process leads to a uniform monolayer of 
several square centimetres in less than 1 minute at substrate tempera- 
tures aslowas 200 °C. Extensive atomic-resolution TEM characteriza- 
tion shows unambiguously both the complete absence of long-range 
periodicity anda structure that is consistent with the crystallite model: 
that is, nanometre-sized, randomly oriented and strained crystallites 
comprising only six-member rings embedded ina Z-CRN environment. 
For simplicity, here we discuss representative data on MAC films grown 
onacopper foil at 250 °C unless otherwise stated. Samples are easily 
transferred via wet etching after growth. Unlike other CVD-grown 2D 
materials, MAC is freestanding even on liquid surfaces without the need 
of asupport polymer (Supplementary Video). Figure 1a, b shows that 
the transferred sample (ona SiO,/Si wafer and a TEM grid, respectively) 
is homogeneous and continuous, free from multilayer regions or the 
wrinkles that are typically observed with the transfer of 2D layers. In 
addition, MAC is stable and freestanding even when stored under ambi- 
ent conditions for at least 1 year. 

The MAC samples were first characterized by Raman spectroscopy 
and X-ray photoemission spectroscopy (XPS) measurements. All sam- 
ples have nearly identical spectra independent of substrate (Fig. 1c, d). 
The Raman 2D band at approximately 2,680 cm“, which is pronounced 
incrystalline graphene, is negligible in the MAC, strongly suggesting the 
lack of any long-range order™. Furthermore, the Raman intensity ratio 
|,/I,= 0.82 (Fig. le) indicates an average defect distance” of <1nm. Note 
that Raman mapping of /,//, shows that MAC is uniform over areas of 
50 x 50 pm? (Extended Data Fig. 1). Furthermore, C1s XPS in Fig. If shows 
that the bonds in MAC are mainly carbon sp” (Supplementary Note 1). 

Monochromated, aberration-corrected, high-resolution trans- 
mission electron microscopy (HRTEM) was used to image the exact 
arrangement of carbon atoms in MAC. A large-area HRTEM image 
reveals a connected but distorted structure of five-, six-, seven- and 
eight-member rings (Fig. 2a). The Fourier transform in the inset of 
Fig. 2a shows a broad, continuous halo. Figure 2b shows a zoomed-in 
area of 5 x 5nm7’in which we can clearly see heavily distorted crystallites 
(green regions) about 1nm across, embedded in a CRN background 
that comprises five-, six-, seven- and eight-member rings. As the yellow 
arrows indicate, the crystallites are oriented randomly. This structure 
of MAC is consistent over the entire sample (Supplementary Note 2). 
Thus, MAC is topologically distinct from disordered graphene. 

We also evaluated the pair correlation function of neighbouring 
carbon atoms in MAC samples (Fig. 2e) and a graphene reference (Sup- 
plementary Note 3), acrucial property in deciding whether a material 
is amorphous. Graphene peaks disappear or are highly broadened in 
the MAC pair correlation function after the second nearest neighbours, 
confirming the loss of long-range periodic order. The bond lengths and 
angles for graphene are centred at 1.4 A and 120°, respectively, with 
small variation arising from image aberration and algorithmic error. In 
contrast, MAC has much broader variation, 0.9-1.8 A and 90-150° for 
the in-plane projections of bond lengths and bond angles, respectively 
(Fig. 2f, g). This feature is surprising because the theoretical breaking 
strain at 25-30% for crystalline graphene occurs with deformations® at 
just under 1.6 Aand 135°, so one would expect freestanding MAC to be 
unstable and subject to collapse. On the other hand, sucha wide spread 
inthe bond angle distribution is expected for arandom 2D amorphous 
network, similar to the O-Si-O bonds modelled in a silica bilayer on 
ruthenium”. Moreover, each carbon atom is three-fold coordinated, 
consistent with the sp* bonding concluded from the XPS data. 

Furthermore, the strained crystallites are fundamentally different 
from nanocrystalline graphene domains because they are separated 
not by atomically sharp grain boundaries, but by CRN regions that are 
at least three carbon atoms wide (Fig. 2h, i). Selective-area electron 
diffraction (SAED) patterns show a characteristic diffuse halo (Fig. 2j) 
that confirms the amorphous nature of MAC, in contrast to the sharp 
first- and second-order diffraction rings for nanocrystalline graphene 


4 6 


Fig. 2| Atomic structure of MAC from TEM. a, Monochromated HRTEM image 
of MAC, 10 x 10 nm?, and the corresponding Fourier transform of the selected 
region (inset). b, Large-scale atom-by-atom mapping of the selected region, 

5x 5nm?, ina. The image contrast is inverted and false-coloured for better 
visibility. Colour overlay is added for identification of pentagons (red), 
heptagons/octagons (blue) and strained hexagons (purple for individual 
hexagons; green for crystallite regions) that are omnipresent. Crystallites 
(regions of green hexagons) separate the regions with non-hexagons. 
Crystallites are defined to consist of at least a hexagonal ring surrounded by six 
hexagonal rings. Angular orientation of the hexagons changes across the 
image, as indicated by yellow arrows and the offset angle to the vertical.c, 
Theoretical model created as described in Extended Data Fig. 7, replicating the 
MAC features in band displayed with same colour coding. d, Zoom-in region 


(Fig. 2k). Dark-field TEM also reveals this, with only nanocrystalline 
graphene showing crystallinity (Extended Data Fig. 2 and Supplemen- 
tary Note3). Tosummarize, the direct imaging of the atomic structure 
clearly shows the amorphous nature of MAC to be (nano)crystallite, 
not Z-CRN. 
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highlighted by the red square in b. The bond lengths (in angstr6m) and bond 
angles of each pentagon are precisely measured. e, Pair correlation function 
calculated by mapping the coordinates of each carbon atom inb. Graphene 
imaged under similar conditions with the same mapping algorithm is shown as 
areference. f, Statistical histogram of the distribution of bond length tothe 
first neighbouring atoms for MAC and graphene. g, Statistical histogram of the 
bond angle distribution for MAC and graphene. h, i, Comparison of scanning 
transmission electron microscopy (STEM) imaging for MAC (h) and 
nanocrystalline graphene (i), with false-colour overlay. Original images shown 
in Supplementary Note 3.j, k, SAED patterns over a 3-um-diameter region: for 
the MAC sample shown inh (j) and for the nanocrystalline graphene sample 
shown ini(k). Scale bars, 1nm (a), 0.5nm (b,c), 0.2nm (d),1nm (h, i) and5nm* 
(j,k). 


The interlayer spacing of multilayer MAC was measured by atomic 
force microscopy (AFM) of overlapping MAC layers prepared by mul- 
tiple wet transfers ona SiO,/Si substrate. The spacing was found to 
be approximately 0.6 nm (Fig. 3b, Extended Data Fig. 3). This inter- 
layer spacing is almost twice as large as that of graphene, similar to 
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Fig. 3 |Mechanical properties of MAC. a, Side view of the model MAC shown 
from above in Fig. 2c, showing out-of-plane structural relaxation. b, AFM 
topography of one to three layers of MAC on SiO,/Si. The cyan solid line 
indicates the line-scan position for the overlaid height profile. Image width, 

3 um. c, Histogram of the 2D elastic stiffness (E,,) of 49 suspended MAC 
membranes with applied force of 100 nN, with an average of F,)=115+21INm". 
d, Selected force—displacement curves from multiple indentations with 


phosphorene’s 0.6 nm, and consistent with the out-of-plane buckling 
caused by Stone-Wales defects”. AFM indentation experiments on 
suspended MAC membranes were used to determine the impact of 
structural buckling on its mechanical properties. The distribution 
of 2D elastic stiffness (£,,) obtained by force-deflection measure- 
ments to 100 nN is centred at £,, = 115 Nm” (Fig. 3c). We observed an 
increasing slope (hence stiffness) of the force-displacement curves 


Enp (N mr) 


Membrane deflection (nm) 


increasing force applied in 25-nN steps. Fitting to the linear elastic deformation 
expression is indicated by the dashed red line. Top inset, AFM scan of MAC 
suspended ona2.5-pm-diameter well. The white line is the height profile along 
the centre with a10.0-nm adhesion depth to the well wall. Bottom inset, close- 
up of collapsed MAC (the white line is the height profile along the centre) after 
repeated indentations at 200 nN. Scale bars, 11m (d, topinset) and100 nm 

(d, bottom inset). 


with consecutive indentations of increasing force (Fig. 3d). Hence, MAC 
undergoes plastic deformation with applied force. Furthermore, we 
obtain a breaking strength of 22 N m™, more than half the strength of 
single-crystal graphene”. We note that indentation rupture is restricted 
and does not propagate (Fig. 3c), in contrast to similar experiments 
with crystalline graphene that result in catastrophic failure by rapid 
crack propagation’® (Supplementary Note 4). 
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Fig. 4| Insulating properties of MAC. a, False-colour SEMimage of the two- 
terminal device with a channel width of 500 nmto1pmandelectrodes50 ppm 
long. Scale bar, 20 pm. b, Nonlinear /-V curves measured at five different 
temperatures (Extended Data Fig. 4).c, Resistivity for samples with different 
layer (L) numbers asa function of temperature. The red line is the linear best fit 
with offset wand slope Nequal to 10°°°*°° and -6.5 + 0.05, respectively. 
Leakage current from SiO,/Si prevents lower-temperature measurements. 
Inset, the Arrhenius plot gives an activation energy of 235 meV. d, Top view of 
the model MAC overlaid by the modulus square of the lowest unoccupied 
wavefunction. e, The density of states of the model MAC (solid blue line) shows 
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a peak at the Fermi energy (zero energy onthe plot) compared with that of 
graphene (dashed black line). Inset, phase-coherent electronic transmission 
through the model MAC (solid blue line) compared with that through graphene 
(dashed black line). f, Tauc plot calculated using an absorption coefficient a 
and photonenergy hv (A, Planck constant; v, frequency) to determine the 
optical bandgap in amorphous materials, with extrapolation of the linear 
region (red line) estimating an optical bandgap of 2.1 eV. Inset, optical 
transmittance of MAC on CaF, substrate. The dashed line indicates pristine 
graphene absorption at 2.3%. 


Next, we discuss source-drain bias-dependent electronic transport 
measurements (/-V) as a function of temperature and layer number 
(see also Extended Data Fig. 4). In all samples that we observed, /= V’, 
where a= 2 (Fig. 4b) and the sheet resistance values are of order 100 GO 
at room temperature, similar to CVD boron nitride”. These sheet resist- 
ance values are high for a 2D material and are up to about 10 orders of 
magnitude larger than for graphene. Furthermore, the large activation 
energy of 235 meV (obtained from the Arrhenius plot in Fig. 4c) is 
similar to what has been observed in CVD boron nitride (193 meV)7°. 
On the other hand, the temperature dependence of the resistivity p 
exhibits (Fig. 4c) a power-law dependence, p = a7", instead of the 
expected Mott law (p « e7/To)” wherea, Nand yare fitting factors) for 
variable-range hopping (Extended Data Fig. 5). Inaddition, we observe 
that the temperature dependence of all/—Vcurves collapses toa single 
curve, indicating an apparent power-law dependence (Extended Data 
Fig. 6). The two results together suggest that charge transport in MAC 
is unique in 2D and mimics that in disordered quasi-one-dimensional 
systems, knownas rare-chain hopping”. 

Aperiodically repeatable structural model of MAC was constructed 
to resemble the observed structure in Fig. 2b, with similar distribution 
of ring size and similar randomly oriented crystallites embedded ina 
Z-CRN network (Fig. 2c; see Extended Data Fig. 7 for the construction 
process). Relaxation of the structure using density-functional-theory 
calculations produced buckling in the non-hexagon regions (Fig. 3a), 
as is known to occur at Stone-Wales defects in graphene”. We also 
constructed bilayer MAC by stacking the model structures at different 
relative displacements and minimized the total energy with respect 
to interlayer spacing. We obtained interlayer spacings ranging from 
0.55 nm to 0.62 nm, in agreement with the measured interlayer spac- 
ings discussed above. Quantum molecular dynamics simulations of the 
structure at 300 K reveal that the MAC is stable and undergoes out-of- 
plane distortions in the entire structure in the form of long-wavelength 
flexural phonons, similar to those in graphene”. 

The experimental results described above are in good agreement 
with electronic-structure and quantum transport calculations per- 
formed with the MAC model described earlier (Fig. 2c). In Fig. 4e, 
we show the calculated MAC density of states (DOS) and compare it 
with that of crystalline graphene. It is clear that the V-shaped DOS 
of crystalline graphene around the Dirac point is no longer present. 
Instead, the DOS shows a relatively broad peak at the nominal Fermi 
energy, suggesting an increase of localized states”. In Fig. 4d, we plot 
the wavefunction (shown in blue) of a state within this peak, show- 
ing significant localization on non-hexagon rings that are separated 
by crystallites (Extended Data Fig. 8). The calculated transmission 
coefficient in the region around the Fermi energy, shown in the inset 
in Fig. 4e, approaches zero despite the high DOS, which explains the 
highly resistive charge transport. 

To identify the effective bandgap of MAC, we measured its optical 
properties. The optical transmittance of MAC is 98.1% at 550 nm and 
increases towards 99% in the infrared (Fig. 4f). Clearly, graphene’s 2.3% 
minimum absorption defined by the fine structure constant is no longer 
applicable. Moreimportantly, a Tauc plot gives an optical bandgap of 
2.1 eV with a long tail towards lower energies, which is reminiscent of 
‘tail states’ in 3D amorphous silicon”. We also observed photolumines- 
cence from MAC witha pronounced peak at 2.04 eV, whichis not seen 
in graphene owing to the absence of a bandgap. Charge localization 
could be responsible for both the optical gap and the observation of 
photoluminescence, similar to insulating sp’ amorphous carbon thin 
films with charge localization at isolated sp’ clusters”. Finally, ellip- 
sometry confirmed a bandgap at 2.1 eV and a corresponding relative 
permittivity of about 11 (Extended Data Fig. 9). 

In summary, we have demonstrated a method for the growth of a 
stable amorphous material in the 2D limit, by producing large-area, 
freestanding monolayers of sp*-bonded amorphous carbon films. 
We have furthermore demonstrated that the ring distribution is nota 


Z-CRN but resembles the competing crystallite model. Overall, so far, 
there exists no solid evidence for the formation of homogeneous, sta- 
ble, freestanding Z-CRN monolayers, although their existence cannot 
be ruled out. Finally, stable MAC films directly grown ona wide range 
of surfaces at low temperatures are likely to be useful for a wide range 
of applications, including anti-corrosion barriers for heat-assisted 
magnetic recording on magnetic hard disks, and for current collectors 
and electrodes in batteries and supercapacitors. The possibility of 
engineering a distribution of carbon ring sizes also makes MAC attrac- 
tive for proton transport membranes. 
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Methods 


MAC synthesis 

MAC was synthesized by laser-assisted chemical vapour deposition 
(LCVD) process with krypton fluoride (KrF) pulsed laser (A= 248 nm, 
Coherent Inc.). Copper foils (35 pm thick) were cleaned by sonicationin 
acetone and isopropyl, followed by 30 min annealingin H, at1,010 °Cina 
quartz tube furnace. Sputtered Au and Ru substrates were used without 
prior annealing. The growth substrates were transferred into the LCVD 
vacuum chamber and mounted on the titanium stage. The chamber was 
evacuated to base pressure of 10°° mbar. CH, was introduced into the 
chamber to 10° mbar pressure. The substrate was exposed toa plasma 
(350 kHz pulsed d.c. generator at 20 W) and pulsed laser (40 mJ cm”, 
50 Hz) over a 5-cm area, for complete film formation under 1 min. To 
obtain larger area coverage, the stage was rastered with constant motion 
relative to the laser spot. Substrate temperatures were monitored by an 
infrared thermal sensor (Optis CTlaser 3MH,-CF,). Growth durations 
ranged from 1 min to 10 min, with temperatures due to laser heating 
between 200 °C and 500 °C; MAC growth was of similar quality. Substrate 
temperatures can be further controlled by astage heater, for nanocrys- 
talline graphene grown between 500 °C and 650 °C, and polycrystalline 
graphene grownat >650 °C. Although MAC synthesized on both sides of 
the copper foil has a similar amorphous quality, MAC from the underside 
(facing away from the laser) was used for detailed characterizations. 


MAC sample preparation 

MAC from the top side of the substrate was first removed by 5 min of 
50-W Ar plasma. The Cu foil was wet etched in 0.7 wt% ammonium 
persulfate solution (NH,),S,Og, by floating the growth substrate on 
the solution, with MAC facing up without a polymer support. It was 
transferred to deionized water for 2 htwice, using a rigid substrate, to 
rinse the interface (that is, it was twice transferred and rinsed) before 
scooping onto the desired substrate (SiO,/Si wafers or PELCO holey SiN 
TEM grids with 2.5-~m pores). For transfer of MAC to SiO,,/Si substrate 
with holes for the indentation experiment, spin-coated PMMA (495 
PMMA, A4, 4% in anisole, 4,000 rpm) was required as a polymer sup- 
port during the transfer process. Spin-coated samples were baked on 
ahotplate at 180 °C for 2 min after spin-coating, and the same transfer 
steps were performed. Suspended MAC was dried supercritically from 
acetone in acritical point dryer system. The samples were annealed in 
5% H, in Ar gas for 5h to remove any organic residues. 


Sample characterization 

SEM images were acquired using a FEI Verios 460 field emission scan- 
ning microscope. Raman spectroscopy was measured by ALPHA 300 
R from WITEC (532-nm excitation laser, 1-m-diameter circular laser 
spot). Raman spectra for MAC grown on Cu and Au were measured 
after transfer to SiO, substrate, whereas the Raman spectrum was 
obtained directly on the Ru substrate. XPS spectra were measured 
using UHV Vacuum Generators ESCALAB Mk2 system with Omicron 
7-Channeltron analyser, directly on the growth substrate for Cu, Au 
and Ru. The high-resolution angle-resolved XPS spectrum on Cu was 
measured using the Singapore Synchrotron Light Source at the National 
University of Singapore, with an Omicron EA 125 hemisphere energy 
analyser. XPS C 1s peaks were fitted after a Shirley background subtrac- 
tion using XPSPEAK software with a 30% of Lorentzian—Gaussian ratio. 
The fitted data showa single sp?-hybridized carbon peak with full-width 
half-maximum of 1.0. MAC was transferred toa calcium difluoride sub- 
strate for optical measurements. The ultraviolet—visible absorbance 
was recorded at room temperature on a SHIMADZU UV-3600 spec- 
trophotometer. Photoluminescence measurements were performed 
with an NT-MDT NTEGRA SPECTRA confocal Raman microscope in 
backscattering geometry. We used the excitation laser at 473 nm and 
a100x objective lens with numerical aperture of 0.6 to focus to aspot 
size of approximately 1.5 ppm. The laser power density was kept below 


1kWcm* toavoid heating effects. The photoluminescence signal was 
detected with a charge-coupled device array operating at —80 °C. Opti- 
cal measurements were performed in high vacuum (10* mbar). 


Transmission electron microscopy 

The dark-field TEM (DF-TEM) imaging was conducted in an FEI Tecnai 
F30 microscope operating at 80 kV. DF-TEM images were recorded 
using the diffraction rings selected by the objective aperture, witha 
recording time of 120 s for each image. MAC HRTEM imaging was per- 
formed onanon-commercialized JEOL ARM60 microscope equipped 
with a Schottky field emission gun, a JEOL double Wien filter mono- 
chromator, double delta correctors anda Gatan OneView camera with 
high stability. A slit of 2.8 pm was used for energy filtering, enabling 
a spatial resolution of 1.1A with a beam current of about 20 pA. The 
microscope was operated at 60 kV. The STEM imaging was done ina 
JEOL 2100F with delta probe corrector, which corrects the aberration 
up to fifth order, resulting in a probe size of 1.0 A. The imaging was 
conducted at an acceleration voltage of 60 kV. The convergent angle 
for illumination was about 35 mrad, with a collection detector angle 
ranging from 45 mrad to 200 mrad. AJEOL heating-holder was used in 
all experiments to heat the sample up to 700 °C during imaging. The 
pair correlation function and bond angle distribution were calculated 
from the coordinates of the carbon atoms determined in the HRTEM 
and STEM images by a home-built atom-finding algorithm. 


Computational method 

The theoretical model for the crystallite MAC was constructed using 
the kinetic Monte Carlo method”. The simulation starts with a super- 
cell, 40 x 40 A’, containing 610 carbon atoms randomly placed in the 
x-y plane. The density of carbon atoms is the same as in crystalline 
graphene. The supercell is then relaxed using a conjugated gradient 
algorithm. The interatom interactions are described using the adap- 
tive intermolecular reactive empirical bond-order (AIREBO) potential 
by Stuart’ as implemented in the LAMMPS molecular dynamics pro- 
gram’. Starting from the initial configuration, kinetic Monte Carlo is 
used for annealing. In each Monte Carlo iteration, one Stone-Wales 
transformation is performed to the current configuration: that is, one 
randomly chosen bonded carbon-carbon pair is rotated by 90° inthe 
x-y plane. The probability to accept the new relaxed structure is min 
{1, exp((Eoia— Enew)/Kp7)}, where F,)4 is the energy of the current configu- 
ration, FE, isthe energy of the new configuration after relaxation, and 
k,Tis selected to be 0.5 eV (k,, Boltzmann constant). We perform more 
than 60,000 iterations and generate 1,560 distinct configurations. We 
select the configuration at the 20,000th iteration as the theoretical 
model for its similarity with the experimental HRTEM image (Extended 
Data Fig. 7). The electronic structure is calculated using density func- 
tional theory in the generalized gradient approximation” as imple- 
mented in the VASP software™. The coherent electronic transmission 
probability is calculated using the Transiesta programme. 


Mechanical characterization 

AFM topography was measured by tapping mode using a Bruker AFM. 
This was also used for indentation experiments. MAC was suspended 
on wells with diameter of 1-2.5 pm. Force-displacement curves were 
obtained from indentation experiments performed at the centre of the 
suspended membranes using single-crystal diamond probes with 8-nm 
diamond tips (spring constant of 1-5 Nm) attached to silicon canti- 
levers (K-TEK nanotechnology). The applied force was then increased 
until fracture to obtain the breaking strength of MAC. The 2D elastic 
constant of MAC was calculated by fitting the force-displacement 
results using the equation for force” 


2noé6 Eq?s° 
= - 
In(a/r) a? 


(1) 


where aand rare the radius of the suspended membrane and the AFM 
tip, respectively. o, 6 and E are the 2D pre-tension, deflection and 2D 
effective Young’s modulus of the membrane, respectively, and g (~1.02) 
is a function of Poisson’s ratio (taken as 0.165). 

The breaking strength corresponding to a breaking force F,, was 
then calculated as 


1/2 
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Electronic transport characterization 

Multi-terminal devices of one-, two- and three-layer-thick MAC were 
fabricated ona Si/SiO, wafer using the standard microfabrication pro- 
cedures of electron beam lithography. Oxygen plasma was first used 
to pattern MAC to fixed width of 50 um. Devices were created with 
channel length between 200 nm and 1,000 nm. The width-to-length 
ratio ranged from 50 to 250. The geometry of each sample is shown 
in Supplementary Table. After the second step of lithography, a set 
of Co electrodes (35 nm thick) were deposited in an ultrahigh vacuum 
system by an electron-beam evaporator at the deposition pressure 
of 1x10 torr. Contacts 10 xm wide were made to reduce the contact 
resistance (Fig. 4a). Electrical measurements were carried out in the 
d.c. regime using two Keithley 2400 Sourcemeters for source-drain 
and gate voltages, which allows applying a voltage and measure the 
current simultaneously. The gate leakage was lower than 1nA at room 
temperature and below 25 pA at 150 K. All measurements, for 2D amor- 
phous carbon films, were performed in helium-free cryostats and in 
vacuum better than 10° mbar to ensure reproducible measurement 
conditions. Resistivity measurements are performed at source-drain 
voltage of 1 V. Resistance was extracted from the /-V characteristic 
using a polynomial fit and taken at the first derivative at zero voltage. 
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Extended Data Fig. 1| Optical image and Raman mapping of MAC film 
transferred on Si/Si02 substrate. a, MAC transferred to SiO,/Si wafer. Crease 
and fold at MAC edge from transfer process. b, Contrast-enhanced image in 
greyscale shows no topological features visible on MAC. c, Raman mapping 
shows that the intensity ratio of the D to G band, /p//c, is uniform at the edge of 


SiO, /Si 
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transferred MAC onaSi/SiO, substrate. This ratio is slightly higher than the 
expected ratio of 0.82, as the Raman background signal is not subtracted 
during Raman mapping. d, Raman map (50 x 50 pm’, with 2,500 data points) of 
1p/I, shows uniform quality over a large area (standard deviation is +1.8%). 


Extended Data Fig. 2| Comparison of large-area TEM data. a, Polycrystalline SAED patterns. Both polycrystalline and nanocrystalline graphene show 
graphene with grain size 200-500 nm. b, Nanocrystalline graphene (sample in crystal domains under DF-TEM and have well defined SAED diffraction 
Fig. 2i) with grain size 1-3 nm. c, MAC (sample in Fig. 2h). Left, bright-field TEM; patterns. MAC has no visible domains in DF-TEM. Only MAC has the 
centre, DF-TEM with false-colour image overlay showing crystal domains; right, | characteristicamorphous halo from SAED diffraction. 
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Extended Data Fig. 3 | Thickness of MAC. a, AFM topography of MAC 
transferred onto an atomically flat, exfoliated BN crystal. b, Close-up of the 
region indicated by the white box ina, witha scan length of 3 pm.c, Height 
profile of MAC, obtained by averaging vertically over the entire image scanin 
b, shows a thickness of about 0.6 nm. d, Optical image of MAC grown on gold 
substrates and transferred onto SiO, by wet transfer with KI/I, solution as gold 
etchant and followed by 7 h annealing in Ar/H, at 300 °C. For MAC transferred 
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from gold, residues from the gold etching cannot be fully removed owing to the 
non-optimized transfer process. e, AFM topography of a folded MAC edge ind 
to obtain one and twolayers. f, The height profile along the red dotted lineine 
gives the thickness of MAC grown on goldas about 0.8 nm. Residual 
contamination from the transfer step and difference in substrate interactions 
may account for the higher thickness of MAC when transferred from gold. 
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Extended Data Fig. 4 | Current-voltage measurement and gate dependence. 
a, Nonlinear /-V curves in Fig. 4b shown in linear scale. b, Gate dependence of 
MAC with electrostatic gating by SiO,/Si back gate shows ambipolar behaviour 


and strong temperature dependence, increasing from 100 GO per square at 
room temperature tol TO per square at 200K. 
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Extended Data Fig. 5| Resistivity of samples asa function of the 


temperature. a, Temperature dependence of the resistivity for aset of devices 
with two- and three-layer MAC (denoted BL and TL, respectively). b, The linear 
fit of the resistivity of offset aand power N, as described by the formulap=a7", 
shows alinear correlation, N=2.2(+0.6) —0.33(+0.02)loga. The dataina, bare 
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provided in Supplementary Table.c, d, Resistivity of tri-layer MAC sample 
(sample ID TLO) asa function of temperature plotted ina linear scale. The Mott 
plot (c) shows a temperature limit of 135 K (indicated by the black arrow) up to 
which it can be applied, as opposed to a power-law dependence (d). The red 
lines are the best linear fits. 
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Extended Data Fig. 6 | Universal scaling of nonlinear /-V characteristics for 
MLIsample. All /-V curves are collapsed toa single curve indicating apparent 
power-law dependence ina disordered quasi-one-dimensional system. The 
equation for the universal scaling curve is from ref... 
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Extended Data Fig. 7 | Selection of MAC theoretical model. The configuration 
for the MAC structure at the 20,000th iteration is chosenas the theoretical 
model, shown in Fig. 2c, for its similarity to experimental HRTEM images of 
MAC. Previous steps also show an unstable ring structure (for example, rings 
with fewer than four carbon atoms, or ill-defined rings). Furthermore, the ring 


10,000 steps 
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size distribution from 10,000 stepsis similar . Colour overlay is added for 
identification of pentagons (red), heptagons/octagons (blue) and strained 
hexagons (purple or green) that are omnipresent. Crystallites (green) separate 
regions with non-hexagons. Crystallites are defined to consist of at least one 
hexagonal ring surrounded by six hexagonal rings. 
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Extended Data Fig. 8 |Modulus square of electronic wavefunctions. The lowest unoccupied wavefunctions (blue) are shown for ten states with energies closest 
to the Fermi energy. This shows localization on non-hexagon rings that are separated by crystallites. 
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Extended Data Fig. 9 | Optical measurements of MAC on SiO,/Si. a, 
Photoluminescence spectra were measured with an excitation wavelength of 
473 nm. The peaks marked by asterisks correspond to the Raman signal from 
MAC. The emission spectrum of the film has a broad peak centred at 600 nm. 
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b, Relative permittivity of MAC on SiO,/Si substrate by ellipsometry 
measurements. £, is taken where the imaginary part equals zero, at about 
590 nm (2.1eV), at which the real part of the relative permittivity is 
approximately 11. 
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Graphene films grown by chemical vapour deposition have unusual physical and 
chemical properties that offer promise for applications such as flexible electronics 
and high-frequency transistors’ ’°. However, wrinkles invariably form during growth 
because of the strong coupling to the substrate, and these limit the large-scale 
homogeneity of the film’ *”””. Here we develop a proton-assisted method of chemical 
vapour deposition to grow ultra-flat graphene films that are wrinkle-free. Our method 
of proton penetration? ” and recombination to form hydrogen can also reduce the 
wrinkles formed during traditional chemical vapour deposition of graphene. Some of 
the wrinkles disappear entirely, owing to the decoupling of van der Waals interactions 


and possibly an increase in distance from the growth surface. The electronic band 
structure of the as-grown graphene films shows a V-shaped Dirac cone and a linear 
dispersion relation within the atomic plane or across an atomic step, confirming the 
decoupling from the substrate. The ultra-flat nature of the graphene films ensures 
that their surfaces are easy to clean after a wet transfer process. A robust quantum 
Hall effect appears even at room temperature in a device with a linewidth of 

100 micrometres. Graphene films grown by proton-assisted chemical vapour 
deposition should largely retain their intrinsic performance, and our method should 
be easily generalizable to other nanomaterials for strain and doping engineering. 


Wafer-sized graphene films can be easily produced by chemical vapour 
deposition (CVD)"”. However, the performance of CVD grapheneis still 
inferior to that of small flakes exfoliated from bulk graphite’®. The grain 
boundaries®, atomic vacancies and impurities” inside CVD-grown gra- 
phene are considered to be bottlenecks to their homogeneous quality, 
although some methods have been reported to overcome these chal- 
lenges**. Besides this, electrons in graphene are weakly localized in two 
dimensions witha ballistic transport range over micrometres”’, so not 
only grain boundaries, vacancies and impurities but also wrinkles can 
act as scattering centres. Wrinkles in graphene are considered a kind of 
linear defect" that will degrade the large-scale physical and chemical 
properties. Therefore, both perfect atomic lattices and flatness across 
a large scale will be equally important for super-high performance”. 

During the CVD process, monolayer graphene should adhere to the 
surface ofa growth substrate, such as copper’ or platinum’. The mis- 
match in thermal expansion coefficients between graphene and the 
substrate causes wrinkles (illustrated in Fig. 1a)*”” as an effective way 
to release compressive strain. These are easily formed in rough areas 
suchas atomic steps or surface vacancies”. Some approaches have been 
reported to reduce wrinkles, such as using low thermal expansion coef- 
ficients’, single-crystalline substrates’ or growth at low temperature’, 
but the wrinkles have seemed impossible to eliminate entirely. Trying 
to reduce the interaction between graphene and the growth substrate 
may be the key to this bottleneck’, as illustrated in Fig. 1b. 


CVD processes usually need high temperature (>800 °C) to decom- 
pose a gaseous carbon source’’, and there are plenty of atomic hydro- 
gens, protons and electrons inthe reaction atmosphere. The graphene 
isimpermeable to atomic hydrogen, but the hydrogen cantransform it 
into graphane (C-H bonds) at room temperature (RT). The protons 
are able to penetrate graphene and hexagonal boron nitride (h-BN) by 
an electrochemical process“ “ or as energetic particles”. Once they 
have penetrated into the material, the protons may recombine with 
electrons to form atomic hydrogen again”, bonded to the substrates or 
existing in other forms. The intercalation of hydrogen is assumed to be 
capable of decoupling graphene fromthe substrates after annealingin 
anH,atmosphere””*”’, but the thermal process will not produce enough 
atomic hydrogen and is not effective for delamination”®. Therefore, 
for complete decoupling of graphene films, more protons are needed. 
Inthis study of graphene films on Cu substrates, we controllably gener- 
ate protons by using an inductively coupled plasma (ICP) of hydrogen”, 
instead of the sole thermal process (Fig. 2a). 

We have simulated theoretically the bonding status of recombined 
hydrogen atoms that lie between the graphene and Cu, for different 
temperatures and different densities of atomic hydrogen. First-princi- 
ples molecular dynamics evolution shows that the hydrogen atoms will 
move towards the Cu surface. Some of them bond to Cu atoms, some 
diffuse to interstitial positions between the topmost Cu layers, and 
some re-bond into hydrogen molecules (H,), which can move freely and 
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Fig. 1| Illustration of graphene wrinkle formation and avoidance. a, Schematic illustration of wrinkle formation at an atomic step (or rough area) of ashrinking 
Cu surface when cooling down. b, Schematic illustration of the disappearance of wrinkle when decoupling graphene from its substrate during CVD growth. 


influence the van der Waals interaction between graphene and the Cu 
substrate. At equilibrium, the simulations at RT result in a distance of 
~3.9 A between the Cusurface and the graphene for H:C =1:2 and~3.6A 
for H:C = 1:4. This surface distance changes to ~6.4 A for H:C = 1:2 and 
~3.6 A for H:C=1:4 at 400 °C (Fig. 2b). Therefore, the higher density of 
atomic hydrogen is useful to decouple the interlayer interactions by 
increasing the distance between the substrate surface and the graphene 
layer, and the higher reaction temperature also appears to be helpful 
to the formation of H, (more details are given in Extended Data Fig. 1a, 
band Supplementary Video 1). 

Based on the theoretical results, we monitor the evolution of wrin- 
kles under different ICP treatments. First, graphene films are grown 
on single-crystalline Cu(111) films by a traditional CVD process. Wrin- 
kles invariably exist in these films (Extended Data Fig. 1c-e), despite 
growth at lower temperature (Extended Data Fig. 2a) or with a higher 
H:C ratio (Extended Data Fig. 2b, c). Atomic force microscope (AFM) 
imaging of as-grown graphene films on Cu(111) shows sharp atomic 
steps on the surface (Fig. 2c). The wrinkles (bright lines) occur along 
or perpendicular to the Cu(111) atomic steps, indicating the existence 
of the graphene films. There are also numerous nano-sized bubbles 
(bright dots). The AFM phase image appears homogeneous except 
for the regions of wrinkles, bubbles and atomic steps (inset of Fig. 2c). 

This wrinkled graphene film is then treated by ICP in H, atmosphere 
at 400 °C and 650 °C in turn (Fig. 2d, e). The ex situ AFM images show 
obvious changes in wrinkles and bubbles. After ICP at 650 °C, most 
wrinkles are reduced, and some have been eliminated. Meanwhile, 
more bubbles arise, and they show major differences in the AFM phase 
images. After ultrahigh vacuum (UHV, <10~ Pa) annealing at 400 °C, the 
bubbles are enlarged, but more wrinkles have disappeared completely 
(Fig. 2f). The enlarging and increasing bubbles indicate the existence 
of H, molecules”, formed by rebonding of atomic hydrogen. Height 
profiles across the same wrinkle after different stages of treatment are 
shown in Fig. 2g, marked by pink lines in Fig. 2c—f. This wrinkle in the 
as-grown film is about 4 nm in height, but it reduces to ~3.2 nm after 
ICP at 400 °C, disappears after ICP at 650 °C and never returns even 
after UHV annealing. 

Typical Raman spectra of graphene films before and after ICP treat- 
ment and UHV annealing are plotted in Fig. 2h. There are no obvious 
D bands after the ICP treatments, demonstrating that no graphane 
or atomic defects were formed by this controllable plasma at a tem- 
perature of more than 300 °C (more details are given in Extended Data 
Fig. 3a)”*. After ICP treatment at 400 °C, the Raman frequencies of 
the G (w,) and 2D (@,,) bands show no obvious shift, but their rela- 
tive intensity of /,,//, becomes a little smaller. Both @, and @,) show 
significant blueshift after ICP treatment at 650 °C and UHVannealing, 
which results from the increased compressive strain in graphene?2?99, 


These changes can be reproduced on weakly defective graphene films 
on Cu(111) and micromechanical exfoliated monolayer graphene on 
SiO,/Si, but cannot be reproduced on heavily defective graphene 
because of large atomic vacancies, and cannot be reproduced by helium 
ICP, for which there are no protons in the reaction gas (Extended Data 
Figs. 3b-d and 4). All of this indicates that the van der Waals interactions 
between graphene and the underlying surface have been changed by 
the penetrating protons and recombined hydrogens, but not by the 
direct diffusion of molecular hydrogen. 

Although post-treatment by ICP can effectively reduce the density 
and height of wrinkles once they have formed, there are still some wrin- 
kles, along with newly generated bubbles. The schematic illustration in 
Extended Data Fig. 5 is shown to make clear that these wrinkles cannot 
be removed. Large-area Raman mapping also shows that the distribu- 
tion of @,p is not homogeneous (Extended Data Fig. 6a, b). We there- 
fore developed the CVD growth process further by supplying enough 
protons by creating a plasma in an H, atmosphere, and hence avoiding 
the formation of wrinkles in the first place. The numerous protons that 
have penetrated into the material lead to re-bonded H, lying between 
the as-grown graphene and Cu(111), and this is intended to decouple 
the van der Waals interactions during the CVD growth so that the grown 
graphene canslide freely without being dragged by the atomic steps or 
atomic vacancies. A photo of 4-inch graphene films grown on Cu(111) 
by proton assistance is shown in Fig. 3a: it looks homogeneous on the 
large scale, anda typical AFM image of this surface is shown in the inset. 
The roughness of the ICP-CVD graphene films is compared with that 
of CVD graphene on Cu foil and ona Cu(111) substrate in Fig. 3b (details 
in Extended Data Fig. 6c). This developed growth process retains the 
roughness of the Cu(111) surface, and no sharp atomic steps appear 
after growth, which is different from the traditional CVD process. 
The @, and @p of multi-point Raman spectra collected from 
30 pm x 30 um regions are plotted in Fig. 3c (details in Extended Data 
Fig. 6d). ICP-CVD graphene films appear more compressed than the 
others, and the Raman frequencies appear similar to those of wrinkled 
graphene films after ICP treatment at 650 °C. The higher compressive 
strains of these ICP-CVD graphene films are consistent with the absence 
of wrinkles. Similar to protons, deuterons can also permeate freely into 
graphene®, and this could also be used to flatten wrinkled graphene 
by ICP post-treatment to grow ultra-flat graphene films (Extended 
Data Fig. 6e, f). 

Moiré patterns, formed by the strong van der Waals interactions, can 
easily be observed in the graphene/Cu’ and graphene/SiC” systems. 
However, there are no obvious moiré patterns in the ICP-CVD graphene 
films (inset of Fig. 3d and Extended Data Fig. 7a). Scanning tunnelling 
spectroscopy (STS) spectra were collected at different points marked 
in the scanning tunnelling microscope (STM) images in Fig. 3d. They 
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Fig. 2 | Proton-assisted relaxation of graphene wrinkles on copper. a, 
Illustration of protons penetrating through the as-grown graphene film during 
ICP treatment. The van der Waals interaction between the graphene and the 
substrate is altered by the recombined hydrogen (H atoms and H,).b, 
Theoretical simulation of bonding status for recombined hydrogen at different 
density. A high density will lead to the production of re-bonded H,, the 
presence of which can decouple the interaction between the graphene and 
substrate, at 400 °C. c-f, Typical AFM images of wrinkled graphene film on 
Cu(111) after different treatments: the surface as grown by CVD (c); after ICP at 


all show no energy gaps and obvious V-shape Dirac cones around the 
Fermilevels. Linear dispersion features are homogeneously present at 
the atomic planes and steps on the Cu(111) surface. Moreover, the moiré 
patterns and energy gaps in the wrinkled graphene/Cu(111) system 
could be almost eliminated by the ICP post-treatment (Extended Data 
Fig. 7b, c). Angle-resolved photoemission spectroscopy (ARPES) is also 
performed to characterize these samples. At the K point (Fig. 3e), linear 
conical Dirac dispersion appears in the ICP-CVD films. Their m-bands 
intersect at a Dirac point of approximately 40 meV below the Fermi 
level, indicating slight n-type doping. Graphene suspended across a 
micro-hole shows a Dirac point at 0 V without any doping. The feature 
of the slight shift of the Dirac point inthe sample in Fig. 3e is similar to 
those in suspended samples. However, the quasi-suspended feature 
is ina metastable state, and it can easily be destroyed by annealing 
at higher temperature (Extended Data Fig. 7a, d). The Dirac point of 
this wrinkled film on Cu(111) before ICP treatment declines at about 
310 meV, and a mini bandgap of about 130 meV emerges at the Dirac 
point, formed by the strong coupling interactions between graphene 
and Cu”. After ICP treatment at 650 °C, the Dirac point can rise to 
about 170 meV and the bandgap reduce to about 90 meV, implying that 
the coupling between graphene and the substrates becomes weaker. 
Itis worth noting that the doping levels of the ICP post-treated graphene 
films are also inhomogeneous, and they are easily changed during the 
UHV annealing procedure (Extended Data Fig. 7e, f). 
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400 °C (d); after ICP at 650 °C (e); and after UHV annealing at 400 °C (f). Insets 
are the AFM phase images. The wrinkles and bubbles in the graphene films are 
changed during the ICP treatments. g, Height profiles for the same wrinkle, 
taken along the line marked in pink inc-f. The wrinkle disappears completely 
after ICP at 650 °C. h, Typical Raman spectra of graphene films as-grown and 
after ICP treatment, showing that no graphane or atomic defects arise. All the 
AFM images use the same height bar of -5 nmto5nm, andthe length scale bars 
inc-fareall1pm. 


For better monitoring of the van der Waals interactions between 
graphene and substrates, we performed in situ Raman measurements 
with variable temperature (7) for different graphene films. In principle, 
if the graphene films have the same interlayer coupling to the sub- 
strates, their Raman frequencies (w) will show the same Aw/AT for dif- 
ferent graphene films because of the same compressive strain change 
(Ae/AT). After temperature cycling, the Raman spectrum returns to 
its previous form at the same temperature, demonstrating that the 
temperature cycling process is non-destructive (Extended Data Fig. 8a). 
When the temperature decreases from 300 K to 10 K, a, and @,p both 
showablueshift (Fig. 4a). Their shifts are approximately proportional 
to the temperature variation, and the relative intensity ratio and full- 
width at half-maximum (FWHM) barely change during this temperature 
cycling (Extended Data Fig. 8b). The Aw,/AT and A@,)/AT are estimated 
to be 2.65 x 10? cm™K ‘and 7.31 x 10* cm™K "for the as-grown wrin- 
kled films, and they decrease to approximately 1.82 x 10 cm™K ‘and 
4.05 x10* cm K ‘after ICP treatment at 650 °C. The reduced Aw/AT 
indicates an insensitive Ae/AT and a gradual decoupling during these 
ICP treatments. Moreover, the Aw,/AT and A@,p/AT for ICP-CVD grown 
graphene are approximately 1.70 x 10*cm™K ‘and 4.20 x10?cm'K‘, 
indicating that ICP treatment during or after CVD growth has a similar 
effect on decoupling graphene. 

Pure graphene/SiO,/Siand graphene/Cu(111) without hydrogen treat- 
ment (hydrogen ICP treatment or annealing in hydrogen atmosphere) 
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Fig. 3 | Proton-assisted growth of wrinkle-free and quasi-suspending 
graphene films. a, Photo of 4-inch graphene film grown on Cu(111). Inset, 
typical AFM image. b, Typical smoothness of substrates after graphene growth: 
Cu foil and Cu(111) covered by wrinkled graphene films, and Cu(111) covered by 
ICP-CVD grown graphene films. c, Distribution of w,and @,, obtained from 
multiple points in the ICP-CVD grown films and other wrinkled samples, along 
with a suspending film (black solid) as reference. d, STS spectraonICP-CVD 
grown graphene film across a Cu(111) atomic step. Their dispersions appear 


before and after UHV annealing are also investigated for comparison. 
Inthe transferred graphene/SiO./Si, the surface distance is reduced by 
about 5 Aafter UHV annealing at 430 °C, and this will cause a stronger 
van der Waals interaction (Ag/A7) and larger Aw/AT (Extended Data 
Fig. 8c)””*. In the pure graphene/Cu(111) system, UHV annealing should 
also be an efficient way to change their coupling through adjusting 
their surface distance. The w, and @,, are changed after UHV anneal- 
ing at higher than 200 °C, and all of these changes are irreversible. 
The Aw, and Aw,, extracted for graphene transferred to Cu(111), 
obtained from variable-temperature Raman measurements, are plot- 
ted in Fig. 4b (details in Extended Data Fig. 8d). The Aw,/AT and A@,p/ 
AT before UHV annealing are approximately 1.67 x 10? cm? Kt and 
4.11x10*cm™K7, and these change to approximately 2.22 x10*cm7K* 
and 5.97 x 10* cm™K ‘after UHV at 430 °C. It is notable that these 
Aw,/ATand Aq@,)/AT for as-transferred graphene have the same values 
as those of ICP-CVD graphene and wrinkled graphene after ICP post- 
treatment. Itis well known that the as-transferred graphene films before 
annealing have weak van der Waals coupling to the substrates. Hence, 
the penetrating protons and recombined hydrogens may reduce their 
coupling level by increasing their surface distance, whichis consistent 
with the theoretical simulations above. 

The ultra-flat nature makes it easier to remove the surface residues 
that are inevitably introduced by the transfer procedures”. Figure 4c 
counts the number and size of residual nanoparticles (NPs) on differ- 
ent graphene films. The average numbers are about 40, 100 and 400 
for ultra-flat films, wrinkled films on Cu(111) and Cu foils, respectively, 
in regions 30 pm x 30 pm. Typical AFM images of these transferred 
graphene films are shown in the inset of Fig. 4c and Extended Data 
Fig. 9. There are almost no NPs larger than 10 nm in the ultra-flat gra- 
phene films, whereas other films usually have larger and denser residual 


-0.2 -0.1 0.0 01 0.2 
kK, (A) 


-0.1 0.0 0.1 Low 


k, (A) 


-0.1 0.0 0.1 
kK, (A") 


linear (that is, V-shaped rather than U-shaped) and there is no obvious doping. 
Inset, the related STM image. e, ARPES of as-grown graphene film on Cu(111) 
with protonassistance, and wrinkled samples before and after ICP treatment, 
showing that their doping level can be changed by proton penetration. The 
green lines onred hexagons showthe ARPES cutting orientations and locations 
in the Brillouin zone of graphene. Scale bars: inset toa, 21m; insettod,2nm, 
respectively. 


NPs. The inherent wrinkles will adsorb large-sized NPs, which are more 
difficult to remove completely. 

The ultra-flat and easy-clean surface of the graphene strongly indi- 
cates homogeneous performance. Here, the robustness of the quantum 
Hall effects (QHE) is measured. We fabricated standard Hall bar devices 
with 100 pm linewidth, for which the carrier mobility detected from the 
field effect extracted from longitudinal resistance (R,,,) is approximately 
4,900 cm? V's tat RT (Extended Data Fig. 10a)°. Figure 4d shows R,,. 
and Hall conductivity (0,,) as a function of gate voltage (V,,) under a 
fixed magnetic field of 7.5 T at different temperatures. The first o,, pla- 
teaus, which are a hallmark of the Hall effect, appear at 2e?/h for both 
electrons and holes even at RT”. At 1.5 K, the fourth integer plateau is 
already visible, indicating the extraordinary transport properties of 
this sample. The robust QHE in the ultra-flat films indicates ballistic 
transport properties, and the performances of all of these graphene 
films should be much closer to their extraordinary intrinsic nature than 
seen in wrinkled films (more in Extended Data Fig. 10b-d). 

In summary, a proton-assisted CVD method has been developed 
to grow ultra-flat graphene films by decoupling their van der Waals 
interaction with the Cu substrates. Their ultra-flat nature has been dem- 
onstrated by the strong compressive strain, quasi-suspending status, 
decoupled interaction and wrinkle-free morphology. Furthermore, 
the decoupling process originates from the protons permeating into 
the graphene and the recombined hydrogens that then result, rather 
than the direct intercalation of molecular hydrogen. The ultra-flatness 
could allow graphene films to retain more homogeneous properties 
ona large scale, such as their easy-clean surface and robust QHE, and 
will keep their properties closer to the intrinsic values than is the case 
for wrinkled films. This proton penetration process should be easily 
generalized to other materials to tune their doping or strains. 
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Fig. 4 | Easy-clean nature and robust QHE in ultra-flat graphene films. a, Aw, 
and Aw,, extracted from in situ Raman measurements of wrinkled graphene 
before and after ICP treatments, and the ICP-CVD grown graphene as 
reference. The temperature is varied from10 K to 300K. The results indicate 
that the Ac/AT values become less sensitive to temperature change after 
higher-temperature ICP treatment (150, 400, 650 indicate temperatures inK 
for the ICP process). b, Extracted Aw, and Aw,, for graphene transferred onto 
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Methods 


CVD growth of graphene 

Proton-assisted growth of graphene films is performed in a custom- 
ized ICP-CVD system. The typical growth parameters are as follows: 
sputtered 800-nm Cu-Ni(111) films on c-plane sapphire, growth tem- 
perature of 650 °C, pressure of 6 Pa, CH,/H, ratio of 1:20, plasma power 
of 15 W and growth time of 5 min. Traditional CVD-grown graphene 
films are produced ina tube furnace, with typical growth parameters 
as follows: sputtered 800-nm Cu-Ni(111) films on c-plane sapphire or 
commercial Cu foils, growth temperature of 1,050 °C, atmospheric 
pressure, CH,/H,/Ar ratio of 0.02:10:500 and growth time of 30 min. 
We use Cu-Nialloy (90% Cu + 10% Ni) instead of Cu in this study to avoid 
sublimation at 1,050 °C or UHV annealing at 650 °C, and maintain the 
smooth surface after growth. To simplify writing, we use Cu(111) instead 
of Cu-Ni(111) alloy in the main text. 


Transfer of CVD graphene 

After growth, graphene/Cu(111)/sapphire or graphene/Cu foil is spin- 
coated with polymethyl! methacrylate (PMMA) (molecular weight 996 
kD, 4 wt% in ethyl lactate, 2,000 r.p.m. for 1 min). For the samples used 
for variable-temperature Raman measurement, a thermal release tape is 
used to cover the baked PMMA/graphene/Cu(111)/sapphire. An aqueous 
solution of 1M (NH,),S,Ogis used to etch Cu, and the tape/PMMA/gra- 
pheneis pulled out and cleaned with DI water three times. After drying 
the grapheneat 80 °C for 12h, tape/PMMA/grapheneis pasted on SiO,/Si 
or Cu(111) substrate, then baked at 100 °C to bond graphene/Cu(111) 
tightly. All the pasting processes are operated in a glovebox to avoid 
Cu oxidation. Finally, PMMA is completely removed by acetone. For the 
samples used for transport measurements: first, 1M (NH,),S,0, aque- 
ous solution is used to etch Cu. The PMMA/graphene is then cleaned 
with DI water three times. After that, SiO,/Si substrate is pulled out of 
the PMMA/graphene, which is floating on DI water, and baked at 80 °C 
for 12h. Finally, PMMA is completely removed by acetone. 


Preparation of exfoliated graphene sheets 

First, graphene sheets are exfoliated on an SiO,/Si wafer by a traditional 
standard micromechanical cleavage method. Thenthesamples are UHV 
annealed (<10>° Pa) at 650 °C for 30 minand H, annealed under 6 Paat 
650 °C for 10 min, inturn. For encapsulated graphene, all the edges of 
the exfoliated sheets are encapsulated by 100-nm Au, whichis patterned 
by standard micro/nanofabrication (electron beam lithography and 
thermal evaporation). For defective graphene, defects are introduced 
into the exfoliated samples by Ar ICP (200 W for 100s at RT under 6 Pa). 


Preparation of defective graphene films 

All the initial graphene films are grown by traditional CVD on Cu(111) 
at 1,050 °C. For weakly defective films, they are treated by Ar plasma 
at 50 W for 60 s under 6 Pa at RT; for intermediate defective films, Ar 
plasma at 200 W for 50 s under 6 Pa at RT; for heavily defective films, 
Ar plasma at 200 W for 300 s under 6 Paat RT. 


ICP post-treatment 

For ICP post-treatment with H,, the open, encapsulated and defective 
exfoliated graphene sheets, or the CVD graphene films, are all post- 
treated by H, ICP at 2 W for 60s under 6 Paat 650 °C (‘ICP @ dark’), or 
20 W for 60s (‘ICP @light’, as this produces a glow within the plasma). 
For ICP post-treatment with D,, all the processes follow the same param- 
eters as above, only replacing H, by D,. For ICP post-treatment with He, 
the CVD graphene films are treated by ICP at 20 W for 300 s under 6 Pa 
at different temperatures. 


Characterization 


Optical measurements. RT Raman scattering is performed using 
a WiTec/alpha 300R confocal microscope with 532-nm laser under 


ambient conditions. Variable-temperature Raman scattering is 
performed with 532-nm laser excitation and a grating spectrometer 
from Princeton Instruments, with the sample temperature controlled 
by aclosed-cycle Montana Instruments Cryostation C2. The laser power 
is set below1 mW to avoid heating. The Gand 2D peaks in all the Raman 
spectra are fitted Lorentzians. 


AFM, STM, STS and ARPES measurements 

AFM measurements are performed with a Bruker Dimension Fastscan 
system at tapping mode. The STM of graphene films on Cu(111) are per- 
formed under UHV at 77 K. STS differential conductance (d//dV) point 
spectra are measured in constant-height mode using standard lock-in 
techniques at 77 K (f=773.1Hz, AV,,,;=12 mV). The ARPES measurement 
is performed by a hemispherical analyser (Scienta DA30L) with a He I 
plasma ultraviolet light source (Fermi Instruments). Samples are cooled 
down to 15 K during ARPES measurements. 


Transport measurements 

The electrical transport measurements are performed in a*He cryostat 
with a superconducting magnet (Oxford Teslatron 8 T) with source 
meter (Keithley 6430) and lock-in amplifier (Stanford SR830). All 
transport data are transformed by tensor inversion®. The field effect 
mobility is extracted from py: = C' (do/dV,) (ref. **), where Cis the 
gate capacitance per unit area and calculated as -1.22 x10 *F m7” fora 
285-nm SiO, layer. Carrier mobility is also calculated by the Hall effect 
when applying a perpendicular magnetic field. The Hall effect mobil- 
ity is calculated from py, = (L/W) x (1/Ryx) X (dR,,/dB), where B is the 
applied perpendicular magnetic field, L is the length of the channel, W 
is the channel width, R,, is the Hall resistance and R,,,is the longitudinal 
resistance. The density of carriers is calculated as n = C(V,- Vpirac)/€, 
where Cis the capacitance and eis the electron charge. 


Computational details 

The projector-augmented wave method* as implemented in the Vienna 
ab initio simulation package (VASP)* was used together with the gener- 
alized gradient approximation exchange correlation functionals inthe 
Perdew-Burke-Ernzerhof” parameterization. First-principles molecu- 
lar dynamics simulations of graphene/H/Cu(111) systems with different 
H:C ratio at 298 K, 673 K and 923 K under atmospheric pressure. The 
supercell used for calculations contains single-layer graphene with 24 
carbon atoms and five-layer Cu(111) with 60 copper atoms, together 
with 6 or 12 hydrogen atoms (H:C = 1:4 or 1:2) inside, and experimental 
values of lattice parameters of graphene and copper were used**””. The 
initial surface distance between graphene and Cu(111) topmost layer is 
set to 6A, and there is also a 30-A vacuum layer above graphene. The 
minimum distance of hydrogen atoms from the surface in our initial 
supercell is about 2.38 A, which guarantees that the hydrogen atoms 
are not directly bonded. The hydrogen atoms are initially located at 
the exact centre between graphene and the Cu(111) topmost layer. 
The plane wave cut-off is set to 600 eV, and [-point sampling of the 
Brillouin zone is employed. The first-principles molecular dynamics 
calculations are performed in the NVT ensemble with a Nosé-Hoover 
thermostat. A 1-fs step is sufficiently short for the evaluation of the 
motion equations”. Considering the layered nature of the system, 
the van der Waals correction is used with the density functional theory 
D3 method*". At 650 °C, the supercell with H:C = 1:4 has six hydrogen 
atoms. The equilibrium of this system results in the distance between 
the graphene and Cu(111) surfaces decreasing to 3.5 + 0.5 A, because of 
the interlayer van der Waals interaction. As the graphene drops towards 
the copper, this causes an increase in pressure, making the hydrogen 
atoms move towards the Cu(111) surface and finally bond to copper 
atoms on the surface; some of them even diffuse to the interstitial 
spaces between the topmost copper layers. On increasing the hydrogen 
density to H:C = 1:2, a proportion of the hydrogen atoms re-bond into 
hydrogen molecules (H,) during the simulations, and the movement 


Article 


of H, can eliminate the interlayer interaction. During the simulation, 
higher temperature causes the surface distance to vary strongly, but 
the average surface distance between graphene and copper is still 
larger than the initial one (5-7.5 A). 


Data availability 


The data that support the findings of this study are available from the 
corresponding author on reasonable request. 
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Extended Data Fig. 1| Theoretical simulation and wrinkle recognition. substrate (region 30 pm x 30 pm). Wrinkles are not apparent. d, Enlarged AFM 
a, Top and profile view of the supercell of graphene, hydrogens and Cu(111). image (5 um x 5 um). Wrinkles can be recognized by adjusting the height scale. 
b, Theoretical simulation of bonding status for recombined hydrogen with e, Height profiles extracted fromd. The wrinkles are usually lower thaninm. 
H:C =1:4 and 1:2 at 650 °C. Some of the hydrogen atoms form into H, and The root-mean-square (RMS) roughness is about 0.756 nm for cand 0.538 nm 


increase their surface distance. c, AFMimage of graphene grown on Cu(111) ford. Scale barsince,d are 10 pmand1pum, respectively. 
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Extended Data Fig. 2| Wrinkles occur even when growth is at lower AFM images of graphene grown on Cu(111) at 1,000 °C, adjusting the ratio of 
temperature or higher H:C ratio. a, AFM images of graphene films grown on H,:CH, from 10:1 to 1,000:1. c, Enlarged AFM images of b with zoom-in height 
Cu(111) at different growth temperatures from1,050 °C to 850 °C. b, Typical scale. Scale bars ina—c are 3 pm, 10 pmand 3 pm, respectively. 
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Extended Data Fig. 3 | ICP post-treatment of exfoliated graphene sheets on 
SiO,/Si.a, Raman spectra of monolayer graphene after ICP post-treatment at 
different temperatures 7. b, Intrinsic graphene after post-treatment at 
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c, Encapsulated graphene after different treatments. Scale bar, 10 pm. 
d, Defective graphene sheets after different treatments. 
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Extended Data Fig. 4|ICP post-treatment of different graphene films on 
Cu(111). a, AFMimage of weakly defective graphene films after UHV annealing, 
subsequent H, annealing and H, ICP with light (see Methods) treatment. Inset, 
the related AFM phase images. The bubbles apparently occur only after ICP 
treatment. b, Raman spectra ofacollected at similar position. The Raman 
frequencies show an obvious shift after H, ICP treatment. c, AFMimage of films 
witha medium defect level, after UHV annealing, subsequent H, annealing, 

H, ICP @ dark treatment and H, ICP with light treatment. d, Raman spectra ofc 
collected at the similar position, showing almost no shift. e, AFM image of the 
heavily defective films after UHV annealing, subsequent H, annealing, H,ICP 
with dark treatment and H, ICP with light treatment. f, Raman spectra of 
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ecollected at similar position, showing no obvious shift. g, AFM image of the 
as-grown wrinkled graphene films on Cu(111), after He ICP treatment at 400 °C 
and 650 °C. h, Raman spectra collected from the similar position of g.i, Raman 
shifts collected from five different positions, showing random shift and no 
defects formed (no increase in the intensity at -1,350 cm” of the D band). All the 
above results show that the atomic hydrogen, molecular hydrogen, He and He* 
are not helpful in decoupling the defect-free graphene or weakly defective 
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grown graphene to fully decouple it from the substrates. Scale barsina,c,eand 
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Extended Data Fig. 5 | Proposed mechanism of the irremovable wrinkles by 
ICP post-treatment. Schematic illustrations of evolution in wrinkles by ICP 
treatment. a, Wrinkles formed to release the high f,,,.;,, along with the strong 
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Extended Data Fig. 6 | Inhomogeneous strain in ICP post-treated graphene, 
comparison of graphene on different Cu substrates and D,-assisted 
flattening or growth. a, Typical Raman spectra of wrinkled graphene film after 
ICP treatment at 650 °C. b, Inhomogeneous distribution of wp collected from 
2mm x2mmregion, showing that the strains are inhomogeneous ona large 
scale. c, Typical AFM image of Cu foil fully covered by graphene films. Scale bar, 
1pm.d, Typical Raman spectra of graphene films grown on different Cu 
substrates or suspended ona micro-sized hole. e, Raman spectra of wrinkled 
graphene films on Cu(111) after different treatments and of ultra-flat graphene 
films grown by ICP-CVD. All the processes here use D,. f, Distribution of Raman 
frequency collected from multiple points after different treatments. 
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energy gap of 200-350 meV. c, Large-scale STM images of wrinkled graphene 
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typical STS spectra. Inset iszoomed-in STM image, showing no periodic moiré 
pattern and no energy gap. d, ARPES of ICP-CVD graphene films after UHV 
annealing at 500 °C, indicating the metastable state. e, Typical ARPES images 
extracted from different locations on the ICP post-treated graphene films on 
Cu(111). The doping is not homogeneous. f, The doping level changes gradually 
after UHV annealing, and the Dirac point will change from -120 meV (without 
annealing ine) to -170 meV (annealing at 200 °C) and finally reaches -320 meV 
(annealing at 400 °C). Scale bars ina-care4nm,6nmand4nm, respectively. 
Allthe scale bars inthe insets are 1nm. 
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Extended Data Fig. 8 | Variable-temperature Raman measurements for 
grown and transferred graphene on Cu, and transferred grapheneon 
SiO,/Si.a, Raman spectra with measurement temperature cycled from10 K to 
300 K and from 300 Kto10K. The Raman frequency of the G band (w,) and 2D 
band (@,p) is plotted against the variable temperature, demonstrating that the 
coupling interactions are recovered and this variable-temperature 
measurement is non-destructive. b, Raman frequency shift for graphene films 
grown on Cu(111) by traditional CVD, before and after ICP treatment, and 
related FWHM of Gand 2D bands and /,/I,y for the above spectra, showing that 
FWHMis slightly changed but the /,,)//, seems almost unchanged; Raman 
frequency shifts are also shown for as-grown graphene films on Cu(111) by 
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ICP-CVD and on Cu foils by traditional CVD. c, Ex situ AFM height profiles for 
as-transferred and UHV-annealed graphene films. The thickness is decreased 
by about 5A after this UHV annealing. Insets are related AFM height images. 

In situ variable-temperature Raman measurements are also shown for the 
transferred and UHV-annealed films on SiO,/Si wafers, together with related 
Aw, and A@,,. Scale bars, 21m. d, Raman spectra of the transferred graphene 
on Cu(111) under different UHV annealing conditions, and in situ variable- 
temperature Raman spectra of as-transferred graphene films before (25 °C) 
and after UHV annealing at 180 °C and 430 °C. The variable temperatures are all 
from10Kto300K. 


Extended Data Fig. 9 | Statistics of residual nanoparticles on different 
transferred graphene. a—c, AFMimages of residual nanoparticles on 
transferred graphene films after UHV annealing. The films are grown by 
traditional CVD on Cu(111) (a), traditional CVD on Cu foil (b) and ICP-CVD on 
Cu(111) (c, 150 1m x 90 pm). Transferred ICP-CVD graphene films appear 
wrinkle-free and nanoparticle-free during this large-scale measurement. Scale 
barsina-care1 um, 1pm and 30pm, respectively. 


Article 


Carrier Mobility (x10° cm?V‘'s*) 


wo 


n=2.9 x 10" cm? 


nN 


a ° 


Carrier Mobility (x10* cm?V‘ts*) 
b rx) 


— T=1.5K 
—T=S0K 
—T=100K 
— T=200K 
— T=300K 


— T=200K 
| —T=300K 


— T=15K 
—T=50K 
— T=100K 
— T=200K 
—T=300K 


“4 
10 20 30 40 50 60 70 


V,, (V) 


°o 
-30 -20 -10 0 


Extended Data Fig. 10 | Calculation of carrier mobility in the Hall bar devices 
and the robust QHE inthe ultra-flat graphene films. a, Typical transport 
characteristics of graphene FET with device linewidth of 100 pm measured at 
300 Kand 1.5K (left panel); carrier mobilities extracted by field effect 

Mr ~ 4,900 cm? V's and ~4,700 cm? V's" for electrons and holes at 300 K, and 
~5,600cm?V'stand-5,300cm?V's "for electrons and holes at 1.5K (centre 
panel); and carrier mobilities extracted by Hall effect when applying magnetic 
field from 0 to 7.5 T at 300 K, showing f1,,,), is ~9,800 cm? V's at 300 K with 
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n=2.9x10"cm” (right panel). b, Magnetotransport properties of Hall device 
with linewidth of 20 pm. c, Magnetotransport properties of Hall device with 
linewidth of 100 ym, showing that the first o,, plateaus appear at 2e’/h for both 
electrons and holes at RT under 7.5 T.d, Magnetotransport properties of Hall 
device with linewidth of 500 um, showing that QHE still appears. Insets are 
optical images of the Hall devices. The device with linewidth 500 pm was 
fabricated by simple manual operation. Scale bars in b-d are 20 pm, 100 pm 
and 500 pm, respectively. 
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Strain engineering is a powerful tool with which to enhance semiconductor device 
performance”. Halide perovskites have shown great promise in device applications 
owing to their remarkable electronic and optoelectronic properties® °. Although 
applying strain to halide perovskites has been frequently attempted, including using 
hydrostatic pressurization*® °, electrostriction’, annealing’° ”, van der Waals force”, 
thermal expansion mismatch”, and heat-induced substrate phase transition”, the 
controllable and device-compatible strain engineering of halide perovskites by 
chemical epitaxy remains a challenge, owing to the absence of suitable lattice- 
mismatched epitaxial substrates. Here we report the strained epitaxial growth of 
halide perovskite single-crystal thin films on lattice-mismatched halide perovskite 
substrates. We investigated strain engineering of a-formamidinium lead iodide 
(a-FAPbI,) using both experimental techniques and theoretical calculations. By 
tailoring the substrate composition—and therefore its lattice parameter—a 
compressive strain as high as 2.4 per cent is applied to the epitaxial a-FAPbI, thin film. 
We demonstrate that this strain effectively changes the crystal structure, reduces the 
bandgap and increases the hole mobility of «-FAPbl,. Strained epitaxy is also shown to 


have a substantial stabilization effect on the a-FAPbI, phase owing to the synergistic 
effects of epitaxial stabilization and strain neutralization. As an example, strain 
engineering is applied to enhance the performance of an a-FAPbI,-based 


photodetector. 


a-FAPDbI, is epitaxially grown on a series of mixed methylammonium 
lead chloride/bromide (MAPbCI,Br,_,) single crystalline substrates by 
the inverse temperature growth method”. The resulting MAPbCI,Br;,,. 
substrates, with different compositional ratios and thus lattice param- 
eters, are grown by solutions with different Cl/Br precursor molar 
ratios (Supplementary Fig. 1)”. We note that the strain in the epilayer 
is determined not only by the lattice mismatch, but also by the relaxa- 
tion mechanisms. Lattice distortion relaxes the strain, so the region 
near the heteroepitaxy interface has the highest strain, which gradu- 
ally drops at regions distant from the interface. The total elastic strain 
energy increases as the film grows thicker, until it eventually crosses the 
threshold energy for structural defect generation, and dislocations will 
form to partially relieve the misfit'®. A slow growth rate of the epilayer 
is chosen, as a higher rate will increase the defect concentration in 
the epilayer. The crystalline quality of the substrates is carefully opti- 
mized, as the defects in the substrates can propagate into the epilayer 
(Extended Data Fig. 1). 

Heteroepitaxial growth leads to controllable film thickness, prefer- 
ential growth sites and orientations, compatible fabrication protocols 


with existing infrastructures and scalable large-area device applica- 
tions. Figure 1a shows optical images of a series of MAPDCI,Br3_, sub- 
strates with a layer of epitaxial a-FAPbI, film on the top. The epilayer 
has a uniform thickness with a well defined film—substrate interface 
(Fig. 1b). The film topography can reveal the growth mechanism and 
sometimes the defects caused by strain relaxation. On the one hand, 
asub-100 nm a-FAPDbI, thin film shows a clear interface (Fig. 1b), and 
a well defined terrain morphology, witha step height close to the size 
of a a-FAPbI, unit cell, indicating layer-by-layer growth behaviour of 
the epitaxial a-FAPbI, (Extended Data Fig. 2a, b). A10-m film, on the 
other hand, shows non-conformal growth, indicating strain relaxation 
by dislocation formation (Extended Data Fig. 2c, d). 

The crystallographic relationships between the MAPbCI,Br,_,sub- 
strates and the epitaxial a-FAPbI, thin films are illustrated by high- 
resolution X-ray diffraction (XRD) (Fig. 1c). In their freestanding form, 
both a-FAPbI, and MAPbCI,Br,_, have a cubic structure”. The lattice 
parameters of freestanding a-FAPbI, and MAPbCI,Br,., substrates 
(both with Pm3m space group) are calculated to be 6.35 A (Supple- 
mentary Fig. 1) and 5.83-5.95 A, respectively. The ratio x for each 
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Fig. 1| Epitaxial x-FAPbI, thin films and structural characterizations. 

a, Optical images of the as-grown epitaxial «-FAPbI, thin films. The high 
transparency of the substrates and the smooth surfaces of the thin films 
demonstrate their high quality. Scale bars, 4 mm. b, A cross-sectional scanning 
electron microscope (SEM) image of the epitaxial thin film with controlled 
uniform thickness. Scale bar, 2 um. Inset, magnified SEM image of the 
heterostructure showing a well defined interface. Scale bar, 200 nm. c, High- 
resolution XRD w- 26scan of the (001) peaks of the epitaxial samples on 
different substrates showing the increasing tetragonality with increasing 
lattice mismatch. d, Reciprocal space mapping with (104) asymmetric 
reflection of the a-FAPDbI,, for different lattice mismatches with the substrate. 
The results show a decrease in the in-plane lattice parameter as well as an 


composition is then calculated to be 0-1.50, according to the Vegard’s 
Law (Supplementary Table 1). As xincreases, the MAPbCI,Br3_, (001) 
peaks shift to a higher 26 angle, indicating a decrease in the lattice 
parameters of the substrate and therefore an increase in the lat- 
tice mismatch (Fig. 1c and Supplementary Table 2). Meanwhile, the 
a-FAPbI, (001) peak shifts to a lower 26 angle, indicating an increase 
in the out-of-plane lattice parameter as the in-plane compressive 
strain increases. Whenx exceeds 1.50, the strain energy dramatically 
increases, and the epitaxial growth becomes less thermodynamically 
favourable. a-FAPbI, then randomly crystallizes on the substrate 
(Supplementary Fig. 2). Peak broadening of the epitaxial a-FAPblI, 
is therefore induced by the epitaxial strain and the reduction in film 
thickness, instead of by the strain-induced dislocations or the strain 
relaxation (Supplementary Fig. 3). Figure 1d shows the reciprocal 
space mapping of strain-free and strained a-FAPbI, thin films with 
different lattice mismatch with the substrate. An increase of tetrago- 
nality of the lattice is evident as the compressive strain increases. 
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increase in the out-of-plane lattice parameter with larger compressive strain. 
Q,and Q, are the in-plane and out-of-plane reciprocal space coordinates. 

e, Confocal Raman spectra of the epitaxial layer at different strains. We 
attribute the evolution of the shape and intensity of the peak with strain to the 
increase in lattice tetragonality under higher strain. We note that the broad 
peak at approximately 250 cm ‘is attributed to the Pb-O bond induced by laser 
oxidation. f, Fitting analysis of the Raman peaks. The peak at 136 cm ‘from the 
strain-free sample (black line) is attributed to the Pb-I bond. With increasing 
compressive strain, the peak gradually blueshifts as the bond becomes more 
rigid, and finally splits into a main peak that blueshifts (owing to in-plane bond 
contraction) and a shoulder peak that redshifts (owing to out-of-plane bond 
extension). (a.u., arbitrary units). 


The corresponding strain levels of the a-FAPbI, in those three cases 
are calculated to be 0%, —-1.2% and —2.4%, respectively, on the basis of 
the lattice distortion (where the negative sign denotes compressive 
strain). The Poisson’s ratio is determined to be around 0.3, which is 
consistent with the reported value”. 

We also studied the structure of a-FAPbI, at different strains 
(between 0% and —2.4%, on different substrates) by Raman spectros- 
copy (Fig. le). Control experiments exclude any Raman signals from 
the substrates (Supplementary Fig. 4). The peak at around 136 cmtin 
Fig. le, which originated from the stretching of the lead-iodine bond”, 
increases in intensity and broadens in width as the strain increases. 
The cubic structure of the strain-free a-FAPbI, is less Raman-active, 
and the detectable signal is usually broad and weak. When in-plane 
compressive strain increases, the inorganic framework gradually gains 
tetragonality and produces a stronger Raman signal with a clearly 
distinguishable shape. Interestingly, at around —1.4% strain, the peak 
at 136cm''starts to split into two: a main peak at about 140 cm anda 
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Fig. 2| Optical properties. a, Photoluminescence spectra of a-FAPbI, at 
different strains. The redshift of the photoluminescence peak with increasing 
strain is due to bandgap reduction under compressive strain, consistent with 
the first-principles calculations. b, Focal-point-dependent confocal 
photoluminescence spectra of a 3-um-thick film. When the focal point of the 
laser (indicated by the red point in the schematic; inset) moves towards the 
epitaxial interface, the photoluminescence emission peak shifts from about 
1.523 eV to about 1.479 eV, owing to the large compressive strain close to the 
interface.c, Temperature-dependent photoluminescence spectra of a-2.4% 
strained and astrain-free sample. The bandgap of the strain-free sample shows 


shoulder at about 133 cm‘ (Fig. 1f). When the strain is further increased 
to -2.4%, these two peaks shift to 143 cm™ and 130 cm“4, respectively. 
We attribute the blueshift of the main peak to the compression of 
the in-plane lead-iodine bond, and the redshift of the shoulder peak 
to the stretching of the out-of-plane lead-iodine bond. This result is 
also supported by the simulated Raman spectra by first-principles 
calculations (Supplementary Fig. 4c, d). We also studied the Raman 
spectra of a-FAPbI, of various thicknesses on MAPDbCI, ;.Br,.59 (Sup- 
plementary Fig. 4f). The results are consistent: a strong, sharp peak is 
detected froma sub-100-nm film with -2.4% strain, and a weak, broad 
peak is detected froma film of around 2 um, where the misfit strain is 
relaxed near the film surface. 

Photoluminescence spectra (Fig. 2a) reveal changes in the bandgap 
of sub-100-nm epitaxial a-FAPbI, thin films under different strains 
(between 0% and —2.4%, on different substrates). The photolumi- 
nescence peak of a-FAPbI, gradually shifts from about 1.523 eV at 0% 
strain to about 1.488 eV at -2.4% strain, corresponding toa reduction 
of about 35 meV in the bandgap. We exclude the possible contribu- 
tions to this photoluminescence redshift from thickness-dependent 
bandgap”*™, reabsorption” or halide migration” (detailed discus- 
sions in the Supplementary Information). The bandgap change is 
consistent with the first-principles calculations and absorption 
measurements (Extended Data Fig. 3). The photoluminescence 
peak in Fig. 2a also broadens with increasing strain (Supplemen- 
tary Fig. 5), which is not due to possible charge transfer between 
the epitaxial a-FAPbI, and the substrate (Supplementary Fig. 6). 


Binding energy (eV) 


a stronger temperature dependence than the strained sample, indicating that 
the substrate can reduce the lattice deformation that is caused by the 
temperature change. d, UPS spectra of a-2.4% strained and astrain-free 
sample. The Fermi level and the VBM of the samples can be extracted from the 
intersections of the curves with the horizontal axis, marked by the solid and 
dashed vertical lines, respectively. The results reveal that compressive strain 
increases the VBM more than it does the CBM, owing to the enhanced 
interaction of lead 6s and iodine 5p orbitals under the compressive strain. 
Inset, the schematic band diagram of the -2.4% strained and strain-free 
samples. CB, conduction band; VB, valence band. 


Temperature-dependent photoluminescence studies suggest that 
the emission peak broadening originates from the reduced crystal- 
line quality and the enhanced carrier-phonon coupling under the 
strain (Extended Data Fig. 4). 

Additionally, we studied confocal photoluminescence spectra at 
different locations in an a-FAPDbI, film of around 3 pm ona substrate 
of MAPbCI, .9Br,.5. (Fig. 2b). The photoluminescence peak shifts from 
about 1.479 eV when the laser is focused at the interface where the 
local strain is high, to about 1.523 eV at 3 um from the interface where 
the strain is relaxed. As a control, the photoluminescence redshift 
ina strain-free sample is less obvious (from about 1.516 eV to about 
1.523 eV, Supplementary Fig. 7a), which is attributed to reabsorp- 
tion”. Inthe strained sample, we exclude elastic relaxation although 
halide perovskites are much softer than conventional semiconduc- 
tors’’. Our finite element analysis simulation results show that the 
elastic relaxation for a 3-jum-thick a-FAPbI, thin film is negligible: 
only around 0.09% (Supplementary Fig. 8). Thickness-dependent 
in-plane XRD is used to study the critical thickness at which the strain 
will start to be plastically relaxed (Extended Data Fig. 5). The results 
show that the critical thickness is much less than the thickness we used 
in this study and, therefore, the relaxation can be attributed to plas- 
tic relaxation by the formation of dislocations. Photoluminescence 
measurements from samples of different thicknesses show a similar 
trend (Supplementary Fig. 9), indicating that the strain is relaxed by 
dislocations when the film grows thicker. Temperature-dependent 
photoluminescence studies indicate that the bandgap of a-FAPbI, 
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Fig. 3| Electronic properties. a, Calculated effective masses of the carriers at 
different strains, and electronic bandstructures under three strain levels (3%, 
0% and -3%). The electron effective mass (m;) remains relatively stable with the 
change in strain, while the hole effective mass (m;,) decreases with increasing 
compressive strain. The dashed lines represent the dispersivity of the valence 
band; a less dispersive valence bandstructure indicates a smaller hole effective 
mass. TheZ, Rand A points are high-symmetry points in the first Brillouin zone 
of the tetragonal lattice. Bottom panels with coloured borders represent three 
typical examples with different strains. b, Hole mobilities by Hall effect 
measurements showing that a-FAPbI, with strain of -1.2% has the highest hole 
mobility. Coloured symbols correspond to the strain as inc. The decrease of 
the hole mobility with strain higher than -1.2% is attributed to the increase of 
dislocation density. Number of experiments, n=5 for each strain. Inset, the 


under both 0% and -2.4% strain shows a strong temperature depend- 
ence, owing to the soft nature of a-FAPbI, (Fig. 2c and Extended Data 
Fig. 4)’. The strained-sample bandgap is less temperature-dependent 
compared to that of the strain-free sample, because the smaller ther- 
mal expansion coefficient of the substrate compared to the epitaxial 
layer introduces a constraint”® (detailed discussions in the Supple- 
mentary Information). 

Ultraviolet photoelectron spectroscopy (UPS) reveals the band- 
structure evolution of the a-FAPbI, under strain (see Fig. 2d for 0% 
and -2.4% strain and Extended Data Fig. 6 for other strains). All sam- 
ples exhibit p-type behaviour (see Supplementary Information for 
more details). The Fermi level and the valence-band maximum (VBM) 
of the samples can be extracted from the UPS data. The results show 
that strain of -2.4% lifts the VBM upward by about 50 meV compared 
to the strain-free scenario. Considering the change in the bandgap 
(about 35 meV, Fig. 2a), the -2.4% strain pushes the conduction-band 
minimum (CBM) upward by about 15 meV compared to the strain-free 
scenario. The VBM mainly consists of lead 6s and iodine Sp orbitals, 
and the enhanced coupling between these orbitals under compres- 
sive strain pushes the VBM upward”. The CBM, which consists mostly 
of nonbonding localized states of Pb p orbitals, is less sensitive to 
the deformation of the Pbl, octahedrons’. Therefore, the in-plane 
compressive strain increases the VBM more than it does the CBM. 
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structure of the measurement setup (gold, yellow; parylene-C, grey), not to 
scale. c, Transient photocurrent curves of the epitaxial a-FAPbI, under 
different strains. The transient photocurrent curves are plotted ona log-log 
scale. The carrier transit time—that is, the inflection point of the photocurrent 
curve—is marked by a solid red circle. The inflection point indicates the point at 
which the charge transport carriers switch from the majority to the minority 
carriers. Lines are guides to the eye. d, Plots of calculated carrier mobilities asa 
function of the strain magnitudes. The inset equation, = d’/Vt, transforms the 
carrier transit time to the carrier mobility, where y is the calculated time-of- 
flight carrier mobility, dis the target region thickness, Vis the applied voltage 
and tis the measured carrier transit time. Number of experiments, n=5 for each 
strain. Inset, schematic measurement setup. Coloured symbols correspond to 
thestrainasince. 


The lattice deformation can alter the electronic bandstructure and 
therefore also the carrier dynamics. The effective mass of charge car- 
riers can be assessed by the band curvature extracted from first-prin- 
ciples calculations*’. Figure 3a shows the calculated results of the 
electron effective mass, m;, and hole effective mass, m;,(the top panel) 
and three typical electronic bandstructures (the bottom panels) under 
different strains. On the one hand, the F-k dispersion of the conduction 
band remains relatively unaltered, and m shows only aslight variation 
under strain between 3% and —3%. On the other hand, compressive 
strain can modulate the E-k dispersion of the valence band and con- 
siderably reduce mj, 

To validate these calculations, Hall effect carrier mobilities of the 
a-FAPbI, thin films under strain of between 0% and —2.4% are meas- 
ured (Fig. 3b). Finite element analysis simulation results show that 
potential carrier transfer from the substrate to the epitaxial layer is 
negligible, owing to an insulating layer (Parylene-C) and the energy 
barrier between the epitaxial layer and the substrate (Supplemen- 
tary Fig. 10). All samples measured by the Hall effect show a p-type 
character, whichis consistent with the UPS results. Of all strain levels 
tested, films under —1.2% strain on a MAPbCl) ¢.Br,4. substrate have the 
highest hole mobility (Fig. 3b). Further increasing the strain results in 
a drastic drop in the hole mobility, because of the higher dislocation 
densities that arise at higher strain levels. We note that the devices for 
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Fig. 4 | Epitaxial stabilization. a, Phase stability comparison of thin (sub-100 nm, 
-2.4% strained; pink) and thick (about 10 um, strain-free; black) epitaxial 
a-FAPbI, on MAPDbCI, ;9Br,.59 substrates by XRD. a, a-FAPbI,; 5, 6-FAPbI,;S, 
substrate. The thin, strained sample shows better phase stability (red curves). 
For the thick, strain-free sample, the (001) peak for a-FAPbI, at 13.92° is the 
sameas thestrain-free sample in Supplementary Fig. 1a, which indicates that 
the top surface of the thick sample is fully relaxed (day O, black curve). The X-ray 
can penetrate about 10-20 um into the halide perovskites, which explains why 
the substrate peaks are more intense in the thin sample than in the thick 
sample. The thick, strain-free sample shows signs of a phase transition to 
5-FAPbI, after 24 h (lower black curve). b, Phase stability study by 
photoluminescence spectroscopy. Re-measurement of the thin, strained 
sample after 360 d (lower pink curve) shows no obvious photoluminescence 
peak shift, but does show aslight decrease in peak intensity owing to its natural 


Hall effect measurements have an epitaxial-layer thickness larger than 
the critical thickness to ensure sufficient contact area between the 
halide perovskite and the bottom electrode. Therefore, a high strain 
level will induce a high concentration of dislocations that degrade 
the hole mobility. 

To validate the Hall mobility, we carried out time-of-flight meas- 
urements. The transient photocurrents after single excitation are 
plotted logarithmically in Fig. 3c. The carrier transit time shows the 
smallest value of the film under —1.2% strain. The calculated car- 
rier mobility is plotted as a function of the strain applied (Fig. 3d, 
see the Supplementary Information for calculation details), and 
shows a similar trend to that given by the Hall effect. We note that 
the absolute mobility values from the time-of-flight and Hall effect 
measurements differ, owing to experimental uncertainties in the 
type and quality of electronic contacts made during the fabrication 
processes”. The space-charge-limited-current method can quantify 
trap density”. Results show that a higher strain level leads to a higher 
trap density (Extended Data Fig. 7 and Supplementary Fig. 11), which 
explains the observed decrease in mobility under a higher strain 
magnitude. Capacitance-frequency (C-w) spectroscopy is also used 
to cross-check the trap density (Supplementary Fig. 12), the results 
of which correspond well with those obtained by the space-charge- 
limited-current method. 

Itis widely accepted that a-FAPbI, crystals are metastable at room 
temperature and can quickly phase transform to photo-inactive 
8-FAPDbI, within approximately 24 h (ref. '°), owing to its internal lattice 
strainand low entropy”. Existing strategies for «-FAPbI, stabilization, 
including alloying” and surface passivation™, either widen the band- 
gap or raise the carrier transport barrier by introducing nonconduc- 
tive ligands (detailed discussions in the Supplementary Information). 
However, the epitaxial «-FAPbI, thin film exhibits long-lasting phase 
stability at room temperature. 

Figure 4a shows XRD results of asub-100-nm epitaxial a-FAPbI, thin 
film that is stable for at least 360 d after growth (red curves in Fig. 4a). 
In the 10-pm epitaxial thick film (far beyond the threshold thickness 
at which the strain is fully relaxed), the stabilization effect disappears: 
after 24 h, XRD peaks from 6-FAPbI, can be detected (black curves in 


degradation into Pbl, (ref.’°). For the thick, strain-free sample, the 
photoluminescence spectrum shows an emission peak close to 1.52 eV, similar 
to that inthe strain-free a-FAPbI, bulk crystal shown in Fig. 2a, indicating a 

full strain relaxation in the thick sample. Re-measurement after 24 h (lower 
black curve) shows that the thick film undergoes a transition from the a phase 
tothe 6 phase. Insets, optical images of the two samples, showing clear visual 
clues of the phase stability in the thin, strained sample (black « phase) and the 
phase transition in the thick, strain-free sample (yellow 5 phase) after 24 h. 
Scale bars, 2mm.c, Phase stability study by Raman spectroscopy. The Raman 
characteristics of the thin, strained sample show a peak at 143 cm ‘withno 
substantial difference after 360 d; the thick, strain-free sample (peak at 

136 cm") shows signs of a phase transformation to &-FAPbI, after 24 h, as 
revealed by its signature peak at108cm"™. 


Fig. 4a). The phase stability of the strained a-FAPbI, is also verified by 
photoluminescence (Fig. 4b) and Raman spectroscopy (Fig. 4c). A pos- 
sible stabilization effect from incorporating bromine or chlorine into 
the a-FAPbI, can be excluded, because those foreign ions would stabilize 
the a-phase regardless of the epilayer thickness. X-ray photoelectron 
spectroscopy (XPS) measurements showing the absence of bromine 
and chlorine provide additional evidence that this is not the origin of 
the stability (Extended Data Fig. 8). 

The mechanism of the stable thin a-FAPbI, can be explained by 
two reasons. First, the interfacial energy of cubic a-FAPbI,/cubic 
substrate is much lower than that of hexagonal 6-FAPbI,/cubic sub- 
strate, which is the most critical factor for the stabilization effect 
(Supplementary Fig. 13, Supplementary Table 3, and see Supplemen- 
tary Information for details). The epitaxial lattice is constrained to 
the substrate owing to the strong ionic bonds between them and, 
therefore, the lattice is restricted from the phase transition. Second, 
the driving force of the a-to-6 phase transition is believed to be the 
internal tensile strain in the a-FAPDbI, unit cell, which can induce the 
formation of vacancies and subsequent phase transition”. In this 
study, the epitaxial film is under compressive strain, which neutral- 
izes the effect of the internal tensile strain. Therefore, the synergistic 
effect of the low-energy coherent epitaxial interface and the neutral- 
izing compressive strain are the key to a-FAPbI, stabilization. As a 
control, epitaxial a-FAPbI, thin film is removed from the substrate 
(Supplementary Fig. 14); the removed a-FAPbI, transforms to the 
6 phase within 24 h. 

We demonstrate high-responsivity photodetectors as a use case 
of the strain engineered a-FAPbI, thin film. Figure Sa shows the cur- 
rent-voltage (/-V) characteristics of a strain-free device and a device 
under -1.2% strain. The dark current at -1 V in the strained device 
is around 15% higher than that in the strain-free one, indicating the 
higher defect density of the strained device. However, the photo- 
current in the strained device increases by approximately 180% 
compared to the strain-free device. We attribute the photocurrent 
increase to higher carrier mobility and better alignment of VBM 
to the Fermi level of the gold electrode under compressive strain 
(Supplementary Fig. 15). 
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Fig. 5 | Photodetector characterizations of the a-FAPDI, thin films. 

a, /-Vcharacteristics of Au/a-FAPbI,/indium tin oxide photoconductor 
structured photodetectors. The dark current and photocurrent of the 

-1.2% strained detector are about 15% and 180% higher than those of its strain- 
free counterpart. Detectors are tested with a 685-nm laser under 0.015Wcm7?. 
b, Comparison of responsivity of the -1.2% strained and strain-free 
photodetectors. The responsivity of both devices shows an increasing trend 
with decreasing incident power, as the chances of carrier recombination go 
downat lowillumination intensities®. The strained device yields a higher 


Responsivity of the two photodetectors—defined as the change in 
photocurrent per unit of illumination intensity—is measured at vari- 
ous illumination intensities (Fig. 5b). The responsivity of the strained 
device, which reaches a maximum of 1.3 x 10° A W ‘at an incident 
power density of 1.1x 107 W cm”, is almost twice of that of the strain- 
free device. This is again attributed to the enhanced carrier mobility 
and the better band alignment of the strained device. The responsivity 
of this strained device is, to our knowledge, the highest reported fora 
a-FAPbI, device under similar measurement conditions (for example, 
applied voltage and incident power (Supplementary Table 4)). Similar 
to the trend in Hall effect carrier mobility, the measured responsivity 
peaks at -1.2% strain (Extended Data Fig. 9a). Compressive strain also 
improves the detectivity and the gain of the photodetector (Extended 
Data Fig. 9b, c). Devices with a diode structure can reduce the dark cur- 
rent, but have a much lower responsivity: on average 500 times lower 
than that of the photoconductor-type device (Supplementary Fig. 16). 

The strained device also shows an enhanced external quantum effi- 
ciency over the visible range (Fig. 5c), owing to the enhanced carrier 
mobility as well as more efficient carrier transport across the gold- 
perovskite interface. Additionally, after normalizing the spectra, a 
distinct response in the short-wave infrared region (>810 nm) can be 
identified for the strained device (Extended Data Fig. 9d), consistent 
with the photoluminescence measurements showing bandgap reduc- 
tion under compressive strain. The rise and fall times of the strained 
device are around 30% shorter than those of the strain-free device, 
indicating a faster carrier dynamics (Fig. 5d). 
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responsivity owing to higher carrier mobility and better band alignment. Inset, 
the statistical average of the detector performance. Number of experiments, 
n=5 for eachstrain value. c, External quantum efficiency spectra of the 

-1.2% strained and strain-free photodetectors showing that the strained 
photodetector yields a higher external quantum efficiency as well as a broader 
absorption spectrum (Extended Data Fig. 9d), owing to enhanced carrier 
mobility and bandgap reduction. d, Response times of the photodetectors, with 
faster rise and fall times for the -1.2% strained (9 ts and 34 ps) than the strain- 
free (14 1s and 50 ps) device due tothe enhanced carrier mobility and transport. 
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Methods 


Precursor synthesis 

Methylammonium bromine (MABr) was synthesized as the precursor for 
the substrate growth. First, 20 ml methylamine (40% in methanol, Tokyo 
Chemical Industry) and 21.2 ml hydrobromic acid (48 wt% in water, Sigma 
Aldrich) were mixed in an ice bath and the temperature was maintained 
for the reaction to continue for 2 h. The mixture was heated to 80 °C to 
evaporate the solvent. The precipitate was dissolved in anhydrous ethanol 
(Sigma Aldrich) at 80 °C and cooled down for recrystallization. The crys- 
tals were then centrifuged with diethyl ether and dried at 80 °C overnight. 


Crystal growth 

Methylammonium lead chloride (MAPbCI,) solution was prepared by 
mixing 0.6752 g of methylammonium chloride (MACI, 98%, Tokyo Chemi- 
calIndustry) and 2.781 g lead chloride (PbCI,, 99%, Alfa Aesar) inamixed 
solution of 5 ml anhydrous dimethylformamide (99.8%, Aldrich) and 5 
ml anhydrous dimethyl sulfoxide (DMSO, 99.8%, Aldrich). Methylammo- 
nium lead bromine (MAPbBr;) solution was prepared by mixing 1.120 g 
MABr and 3.670 g lead bromine (PbBr,, 98%, Acros) in 10 ml dimeth- 
ylformamide. The MAPbCI, and MAPbBr;, solutions were mixed with 
different ratios. The mixed solutions were kept at room temperature to 
slowly evaporate the solvent, and single crystals were collected to use as 
substrates. FAPbI, solutions were prepared by mixing formamidinium 
iodide (FAI, 99.9%, Greatcell Solar) and lead iodide (PbI,, 99.99%, Tokyo 
Chemical Industry) at a molar ratio of 1:1in anhydrous gamma-butyrolac- 
tone (Sigma Aldrich) with different concentrations. Strain-free «-FAPbI, 
single crystals were obtained by heating the FAPDbI, solutions to 120 °C. 


Epitaxial growth 

The substrates were heated to different temperatures, and the pre- 
heated FAPbI, solutions (at 100 °C) were then deposited onto the sub- 
strates for epitaxial growth. 


Structural and optical characterizations 

SEMimages were taken with a Zeiss Sigma 500 SEM operated at 3 kV. The 
26/w XRD patterns, the rocking curve (w scan), and the asymmetrical 
reciprocal space mapping around the (104) reflection of the substrate were 
measured by a Rigaku Smartlab diffractometer equipped with a copper 
Ka, radiation source (A = 0.15406 nm) and a germanium (220 x 2) mono- 
chromator. The unit cell parameters (a, c) for (104) reflection reciprocal 
space mapping were converted from (Q,, Q,) by a=1/Q,,c=4/Q,. Raman 
and photoluminescence spectra were measured by a Raman spectrometer 
(Renishaw inVia). Raman peak fitting was done by the Renishaw inVia 
software. Atomic force microscopy was carried out by a scanning probe 
microscope (Veeco) ina tapping mode. XPS and UPS were carried out 
by a Kratos AXIS Supra with an aluminium Ka anode source and a HeI 
(21.22 eV) source, respectively. Measurements were operated under a 
chamber pressure of 10° torr. XPS data were calibrated with the cls peak 
(284.8 eV). If not otherwise specified, bulk a-FAPbI, single crystals were 
usedas the strain-free samples for structural and optical characterizations. 


Device fabrication 

Devices with a vertical structure were fabricated based ona lithog- 
raphy-based method”. Parylene-C (50 nm) and gold (50 nm) were 
sequentially deposited on the substrates, followed by a photolithog- 
raphy process with photoresist AZ-1512. The pattern was composed 
of an array of 2-m-diameter circles (exposed) with 1 um interdis- 
tance (covered by photoresist). The gold was chemically etched with 
wet etchants and the Parylene-C was precisely etched by reactive ion 
etching. The etched substrates underwent secondary growth in their 
corresponding growth solutions so that the substrate surface reached 
the same height as the electrode. Epitaxial growth on the patterned 
substrate enabled the a-FAPbI, crystals to initiate from the exposed 
patterns and gradually merge into a thin film with a controllable 


thickness. We note that the MAPbCI,Br,_, substrates were used for 
the strained devices (heteroepitaxy) and a-FAPbI, substrates were 
used for the strain-free devices (homoepitaxy). The top electrodes 
were then deposited by sputtering (for indium tin oxide, 200 nm). 
For vertical devices, the area of the top electrode was controlled to be 
1x 1mm using a shadow mask. For planar devices, Parylene-C (50 nm) 
and the electrode (gold, 50 nm) were deposited using a shadow mask 
with designed electrode layouts. 


Electrical characterizations 

Space-charge-limited-current measurements were carried out by a 
source meter (Keithley 2400) and a customized probe stationina dark 
environment. Devices with an Au/Perovskite/Au structure were used. 
C-w measurements were carried out by a parameter analyser (B1500, 
Agilent) ina dark environment. Devices with an Au/perovskite/indium 
tin oxide structure were used. The thickness of a-FAPDI, of all devices 
for space-charge-limited current and C-w measurements was con- 
trolled to be 500 nm. Hall effect measurements were carried out with 
a Lake Shore Hall measurement system (HM 3000) using the van der 
Pauw method. We note that the Parylene-C layer prevented direct 
contact between the substrate and electrodes, eliminating possible 
carriers extracted from the substrate. The thickness of the a-FAPbI, for 
all devices for Hall effect measurement was controlled to be 500 nm. 
For the time-of-flight measurement, a 685-nm-pulse laser (10 mW cm”) 
with <10°-s pulse width was used as the light source. Photoresponse 
was measured with an oscilloscope (MSO6104A Channel Mixed Signal, 
Agilent). An external bias of 1 V was applied to drive the carriers in the 
device while a 1-MO resistor was connected in series to simulate the 
open-circuit condition so that the carriers were effectively blocked in 
the devices”. The measurement was carried out in the dark while the 
bias and the laser power were kept constant. The experiment setup 
followed the reported time-of-flight measurement of halide perovskite 
single crystals*?***°, The a-FAPbI, thickness of all devices for time-of- 
flight measurements was also controlled to be 500 nm. 


Photodetector characterizations 

Devices with the structure shown in Supplementary Figs. 15 and 16 were 
used. A 685-nm laser was used as the light source. The /-V characteristics 
were collected ona probe station with an Agilent B2912A source meter. 


First-principles calculations 

First-principles density functional theory calculations were performed 
using the Vienna ab initio Simulation Package (VASP). Electron-ion 
interactions were described using the Projector Augmented Wave pseu- 
dopotential*. The electron-electron exchange-correlation functional 
was treated using the Generalized Gradient Approximation parametrized 
by Perdew, Burke and Ernzerhof®. For bandgap calculations, spin-orbit 
coupling was incorporated owing to the heavy element Pb, and the hybrid 
functionals within Heyd-Scuseria—Ernzerhof formalism with 25% Har- 
tree-Fock exchange were employed. A cutoff energy of 400 eV for the 
plane-wave basis set was used. All structures were fully optimized until 
all components of the residual forces were smaller than 0.01 eV A“. The 
convergence threshold for self-consistent-field iteration was set at 10> 
eV. For optimization of the cubic lattice parameter, aI-centred 3 x 3 x3 
k-point mesh was used. A denser k-point mesh of 4 x 4 x 4 was used to 
obtain accurate energies and electronic structures for strained cells. 
For optimization and static calculations of the heterostructural models, 
T-centred 4 x 4x 1and5 x 5 x1k-point meshes were used, respectively. 
Raman intensities were calculated by the CASTEP module in Materials 
Studios“ with a3 x3 x 3 k-point mesh and a 400 eV cutoff energy. 


Finite element analysis simulations 

Simulation of the current density was done by the multiphysics analysis 
in COMSOL (version 5.4; www.comsol.com). Simulation of the elastic 
strain relaxation was done by the ABAQUS*. 


Data availability 


The data that support the findings of this study are available from the 
corresponding authors on reasonable request. 
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Extended Data Fig. 1| Characterization of substrate quality with different 
growth methods and its impact onthe epitaxial strain. a-f, Rocking curve 
measurements of substrates grown by the inverse temperature crystallization 
(ITC) and slow solvent evaporation (SSE) methods. Lower full-width at half- 
maximum (FWHM) values by the SSE indicate better crystal quality. g, XRD 
patterns of strained a-FAPbI, ona substrate with higher crystal quality (red 
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curve) and relaxed a-FAPbI, ona substrate with lower crystal quality (grey 
curve). Dislocations in the substrates can propagate into and relax the strainin 
the epitaxial a-FAPbI,. The vertical dash line labels the (001) peak position of 
strain-free a-FAPbI,. The peak position from the strain-relaxed FAPbI, (grey 
curve) shifts back to that of strain-free a-FAPbI,. 
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Extended Data Fig. 2| Atomic force microscopy morphology 
characterization of strained and strain-relaxed epitaxial x-FAPbI, films. 

a, b, Atopography image (a) and the corresponding height scanning curve (b) 
of the red line in a of a strained epitaxial a-FAPbI, thin film.c, d, Atopography 
image (c) and the corresponding height scanning curve (d) of the black line inc 
ofastrain-relaxed epitaxial a-FAPbI, thick film. Results show that the strained 
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thin film adopts a layer-by-layer growth model. Nocracks or holes can be 
detected. As the film thickness increases, the total strain energy builds up and 
generates dislocations that propagate throughout the film and relax the strain, 
leading to the formation of cracks and holes. These cracks and holes are 
typically regarded asa signature of strain relaxation. 


Article 


a 
joo1y = 
o 
= 
3 
$ 
[101] 
> 
& 
Qa 
> 
111] 3 
i] 
a 
3210-1 -2 3 
Strain (%) 
c 


Simulation 


c axis parameter (A) 


6.1 6.2 6.3 6.4 6.5 6.6 
a/b axis parameter (A) 


Extended Data Fig. 3| First-principles calculations of the strained a-FAPbI, 
unit cell and experimental absorption spectra of the strained a-FAPbI, 
under different strains. a, Evolution of lattice volume and bandgap asa 
function of strain for three a-FAPbI, lattices with different FA* organic cation 
orientations. For the bandgap calculations, spin-orbit coupling is 
incorporated owing to the heavy element Pb, and the hybrid functionals within 
Heyd-Scuseria-Ernzerhof formalism with 25% Hartree-Fock exchange are 
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employed. b, The absorption spectra of the strained a-FAPbI, thin films. The 
absorption onset redshifts will the increasing strain, which agrees with the 
photoluminescence spectra and prove that the strain can alter the bandgap of 
the a-FAPbI,.c, The caxis length of the unit cell when biaxially straining the a/b 
axes. The slope of the fitted line shows a Poisson’s ratio of about 0.3.d,C-Nand 
C=N bond lengthsat different strain levels. Simulation results show that the 
deformation of the FA‘ skeleton is very small under the applied biaxial strain. 
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Extended Data Fig. 4 | Temperature-dependent photoluminescence 
measurement.a, b, Temperature-dependent photoluminescence of strained 
(a) and strain-free (b) a-FAPbI, before normalization. c,d, Temperature- 
dependent photoluminescence of strained (c) and strain-free (d) a-FAPbI, after 
normalization. Both samples exhibited uniform bandgap narrowing and 
FWHM narrowing with decreasing the temperature. e, f, Temperature- 
dependent photoluminescence (PL) FWHM of strained (e) a-FAPbI, and strain- 
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free (f) a-FAPbI, with fitting. Results show that the strained a-FAPbI, has a 
higher /9, y,. and £,, than that of strain-free a-FAPbI, owing to the strain- 
induced crystalline quality reduction and the strain-enhanced carrier-phonon 
scattering. /y, temperature-independent emission linewidth term associated 
with the structural disorder scattering. y,,, charged-carrier-optical-phonon 
coupling constant. F,o, optical phonon energy. 
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Extended Data Fig. 5 | Plastic strain relaxation study of the epitaxial 
a-FAPDI, thin films. a, b, Thickness-dependent in-plane XRD of -1.2% 
strained (a) and —-2.4% strained (b) a-FAPbI, thin films. Vertical lines label the 
peak position of the fully strained films. Plastic strain relaxation at relatively 
high thickness can be evident by the peak shifting to lower angle and peak 
broadening. c, Thickness-dependent relaxation constant R of the epitaxial 
a-FAPbI, thin films with different strains. Results show that the critical 
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thickness decreases with increasing strain. d, Fitting of the experimental 
critical thicknesses with the People and Bean and the Matthew and Blakeslee 
models (see Supplementary Information refs 69 and 70). Experimental results 
agree well with the People and Bean model, indicating that the plastic strain 
relaxation due to the dislocations generated during the epitaxial growth is the 
dominating relaxation mechanism. 
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Extended Data Fig. 6 | UPS spectra of a-FAPbI, under different strains. a, The 
intersects of the curves with the baseline in the high-binding-energy region 
give the Fermi level position of corresponding strained a-FAPbI, films. There is 
aclear shift of the intersects to higher-binding-energy levels when the 
compressive strain becomes larger. b, The intersects of the curves with the 
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baseline in the low-binding-energy region give the energy difference between 
the Fermi level and the VBM. All a-FAPbI, films have p-type character according 
to the calculated Fermi level position in the bandgap. Meanwhile, the VBMis 
pushed up more than the CBM with increasing strain. Inset, aschematic band 


diagram of the a-FAPbI, under different strains. 
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Extended Data Fig. 7 | Space-charge-limited-current measurement ofthe 
epitaxial x-FAPbI, with different strains. a—d, /-V characteristic curves for 
the space-charge-limited-current measurement of the epitaxial a-FAPbI, film 
with different strains. While the forward scans indicate atypical trap-filling 
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process with increasing the applied voltage, the reverse scan doesn’t showa 
detrapping process. Number of experiments, n=5 for each strain value. Nyrap, 
calculated trap density. 
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Extended Data Fig. 8 | XPS spectra of strained a-FAPbI,. XPS spectra of: a, 13d; b, Pb 4f; c, Br 4p; and d, Cl 2p photoelectrons froma strained a-FAPbI, film. Results 


show that Brand Clare absent in the strained a-FAPbI,. 
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Extended Data Fig. 9 | Photoconductor-type photodetector 
characterizations with a685-nm laser. a, Responsivity as a function of strain 
level in a-FAPbI;. Devices under —0.8%, -1.2% and -1.4% compressive strain give 
better responsivity compared to the strain-free devices. Further increasing the 
compressive strain can lead toa higher density of dislocations, which reduces 
the responsivity. Number of experiments, n=5 for each strain value. 
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Precise protein sequencing and folding are believed to generate the structure and 
chemical diversity of natural channels’”, both of which are essential to synthetically 
achieve proton transport performance comparable to that seen in natural systems. 
Geometrically defined channels have been fabricated using peptides, DNAs, carbon 
nanotubes, sequence-defined polymers and organic frameworks? °. However, none 
of these channels rivals the performance observed in their natural counterparts. Here 
we show that without forming an atomically structured channel, four-monomer- 
based random heteropolymers (RHPs)“ can mimic membrane proteins and exhibit 
selective proton transport across lipid bilayers at a rate similar to those of natural 
proton channels. Statistical control over the monomer distribution in an RHP leads to 
segmental heterogeneity in hydrophobicity, which facilitates the insertion of single 
RHPs into the lipid bilayers. It also results in bilayer-spanning segments containing 
polar monomers that promote the formation of hydrogen-bonded chains” for 
proton transport. Our study demonstrates the importance of the adaptability that is 
enabled by statistical similarity among RHP chains and of the modularity provided by 
the chemical diversity of monomers, to achieve uniform behaviour in heterogeneous 
systems. Our results also validate statistical randomness as an unexplored approach 


to realize protein-like behaviour at the single-polymer-chain level in a predictable 


manner. 


The monomer selection was based on previously designed RHPs"™. 
The two hydrophobic monomers, methyl methacrylate (MMA) and 
2-ethylhexyl methacrylate (EHMA), promote RHP insertion into lipid 
bilayers. Oligo(ethylene glycol) methyl ether methacrylate (OEGMA; 
number average molecular weight, M,, ~ 500 Da) was included to tailor 
the overall hydrophilicity and promote hydrogen-bonded chain (HBC) 
formation>”* within the lipid bilayer for proton transport. 3-sulfopropyl 
methacrylate potassium salt (SPMA) was used to reduce the aggrega- 
tion propensity of the RHPs. 

Although statistically random ona whole-chain level, each RHP chain 
was composed of segments witha range of cumulative hydrophobici- 
ties. The RHP sequences were generated using Compositional Drift, a 
program based onthe Mayo-Lewis model, which considers the relative 
concentrations andthe reactivity ratios of each comonomer pair, and 
on global monomer conversion”. Figure 1a shows 20 representative 
sequences of the RHP (termed ‘RHPI’) with a feeding compositional 
ratio of MMA:OEGMA:EHMA:SPMA = 5:2.5:2:0.5. Although the RHP1 
sequences are statistically random, the heterogeneity is obvious inthe 
cumulative hydrophobicity among different local segments within and 


among RHP!I chains. The local hydrophobicity along an RHP1 chain can 
be evaluated on the basis of the average hydrophilic-lipophilic balance 
(HLB) value of a sliding window (Fig. 1b)'*. An RHP chain can be broken 
down into three types of segments according to the cumulative HLB 
value: those that prefer water, the lipid bilayer and the interface (Fig. Ic). 
Withthe same ensemble composition and statistical monomer distribu- 
tion, individual RHP chains feature different sequences. However, those 
types of segment are seen for all calculated RHP1 sequences. The inser- 
tion of RHP1into lipid bilayers was confirmed by fluorescence imaging 
and differential scanning calorimetry (DSC) measurements. Figure 1d 
shows the confocal images of liposomes incubated with RHP1 end- 
labelled with Texas Red dye. After washing off the excess polymer, the 
fluorescence signal from RHP1 was detected near the lipid membranes 
and overlapped with the signal from a premixed membrane-specific 
dye (NBD-PE). The DSC results show the broadening of the lipid phase 
transition in the RHP1-containing liposomes, confirming RHP1 inser- 
tion, rather than only surface absorption (Extended Data Fig. 1a, b). 
We ran all-atom explicit solvent molecular dynamics simulations 
to visualize the distributions of RHP1 within a lipid bilayer (Fig. le, 
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Fig.1| Segmental heterogeneity in RHP.a, 
Simulated sequences of 20 RHP1 chains (degree of 
polymerization, DP=130). MMA, EHMA, OEGMA 
and SPMAare shown in red, pink, blue and purple, 
respectively. b, Hydrophilic (blue), long (red) and 
short (grey) hydrophobic segments in an RHP1 
chain. Average HLB values for a sliding window of 11 
monomers versus the monomer position are shown 
by the black line. c, Two-dimensional (2D) and three- 
dimensional (3D) illustrations ofan RHP1inalipid 


4 ee 


bilayer. d, Confocal fluorescence images and 3D 


image reconstruction of an NBD-PE-labelled POPC 
60 liposome after incubation with Texas Red-labelled 
RHP1. Scale bar, 20 ppm. e, Snapshot of the spatial 
distribution of an RHP1 chain (the first sequence in 
Extended Data Fig. 2a) in the lipid bilayer in the all- 
atom molecular dynamics simulation. f, Snapshot of 
the HBCs (red sticks) formed within a distance of 


3.5A of the RHP1. 
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Extended Data Fig. 2). Most hydrophilic side chains are found to pro- 
trude into the water or near the membrane surfaces. Hydrophobic 
segments are distributed within the bilayer, anchoring a few OEG side 
chains to the nonpolar region of the bilayer. Those segments are rather 
static, and the OEG side chains in the bilayer move at slower rates than 
the ones in water (Supplementary Videos 1 and 2). The inserted RHP1 
promotes the formation of rather dynamic HBCs among OEGs, meth- 
acrylate carbonyls and waters in the bilayer without adopting uniform 
three-dimensional structures (Fig. 1f, Supplementary Videos 3 and 4) 


120 130 


Texas Red-RHP1 


and should act as ‘stepping stones’ to assist proton transport across 
the bilayers. 

A bioprotonic device (Fig. 2a) was used to probe proton transport 
through a supported lipid bilayer (SLB) composed of POPE and POPG 
(see Methods; POPE:POPG molar ratio, 3:1) ontop ofa Pd/PdH,. contact, 
which translates a proton current into an electronic current”. With 
-200 mV (versus Ag/AgCl) applied to the Pd contact, incorporating 
RHP1(RHPI1:lipid molar ratio, 0.0005) into the SLB increased the current 
of H* across the SLB with respect to the SLB alone. After H* permeated 
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Fig. 2| Transmembrane proton transport. a, Voltage-driven proton transport 
through RHP1 incorporated in an SLB on the Pd/PdH, device. b, pH gradient- 
driven proton flux through RHP1in the liposome. The arrow indicates the time 
when VIn was added. c, Inner pH changes to external alkali metal cations after 
an incubation time of 200s without VIn (black bars). The red bar denotes the pH 
change with VIn and external K*. Error bars are1s.d. (n=3). The RHP1-to-lipid 
molar ratio is 0.0017. 


through the SLB, it was reduced at the Pd surface into H, which phys- 
isorbed onto the Pd surface to form PdH,.. When the voltage on the Pd 
contact was switched from —-200 mV to +40 mV, the positive voltage 
oxidized H into H* and resulted ina positive proton current, i,,,, across 
the SLB. A higher positive i,,, was also detected for the RHP1 contain- 
ing SLB. Using these data, we calculated that adding RHP1 increased 
the apparent membrane permeability of the SLB from 8.1 x 10° s ‘to 
1.6x107°s". The results confirm that RHP1 effectively promotes proton 
transport across the SLB. 

The protontransport was further evaluated using a liposome-based 
fluorescence assay, in which the proton permeation was monitored 
using a pH-sensitive fluorophore pyranine (HPTS). The pH change 
was recorded from liposomes composed of the lipid POPC (see Meth- 
ods; Fig. 2b) or the lipids POPE and POPG (POPE:POPG molar ratio, 
3:1) (Extended Data Fig. 3a, b). Driven by a pH gradient of about 0.8, 
enhancement in the proton flux was observed by adding RHP1 to the 
liposome solutions, which was triggered by the K’ ionophore valinomy- 
cin (VIn) that dissipates membrane potentials. To quantify the proton 
flux rate per RHP1 chain, we premixed the lipid and RHP1 before lipo- 
some preparation and went through freeze-thaw cycles to promote 
RHP1 insertion. We assumed that all RHP1 chains contribute to the 
proton transport and the calculated transport value represents the 
lower limit of the RHP1 performance. The stopped-flow fluorescence 
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Fig. 3 | Interaction of RHP1 and the lipid bilayers. a, SANS profiles of d54- 
DMPC liposomes containing RHP1(-19 kDa) in 86% D,O:H,O. The solvent signal 
was subtracted. Solid lines denote a fit using a three-layer vesicle model for the 
liposome anda sphere model for the RHP1. b, Diffusion coefficients of the 
Texas Red-labelled RHP1s in the liposome membrane asa function of polymer 
molecular weight (7, 19,30 and 40 kDa). Error bars are1s.d. (n=6).c, Inner pH 
changes of the liposomes incubated with RHP1 over 200s after adding VIn. 
Error barsare1s.d.(n=3). 


measurements show an initial proton flux per RHP1 of 8.1+1.0H*s? 
under these conditions, corresponding toa single-RHP1 conductance 
value of (4.4+0.57) x10” S (n=3, s.d.) (Extended Data Fig. 4a-d). This 
is comparable to the value reported for the proton-selective channel 
influenza virus M2, 2.9 x10’ Sto 4.4 x 10” S, at pH 5.7 using a similar 
assay”°. As acontrol, the conductance of the proton/cation channel 
gramicidin A (GramA) was measured to be (4.0 + 0.43) x 10° S (n=3, 
s.d.), inthe same magnitude as the reported values of 1.2 x 10° S to 
2.1x 10° S (refs. 7"””). Slowed proton transport was observed after 
substituting D,O for H,O in this assay (Extended Data Fig. 4e). Theiso- 
tope effect value of RHP1 (1.9) is larger than that of water pore channels 
(1.6), where protons are transported along water wires via the Grot- 
thuss mechanism”. The large isotope effect agrees with the proposed 
transport pathway involving both RHP1 and water, rather than RHP1 
forming a pore in the bilayer®”, 

The insertion of RHP1 did not increase the water permeability 
of the bilayer, confirming the absence of distinct membrane leak- 
age (Extended Data Fig. 4f). The direction of proton transport was 
varied to eliminate the snorkelling effect from the OEG side chains 
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Fig. 4 | Sequence analyses of the RHPs. a, One set of hydrophobic segments 
containing one OEGMA in an RHPI chain (DP = 130). b, Statistical size 
distribution of the hydrophobic segments containing one OEGMA in 4,500 
simulated RHP1 sequences. c, Number of such segments per polymer chain, 
with the minimum segment size ranging from 5to 15. Segments of even 


of bilayer-surface-absorbed RHPI1 chains (Extended Data Fig. 3c). In 
addition, an RHP1 derivative (RHP1,100) containing an OEGMA witha 
longer side chain (M,, ~1,100 Da) was tested. Although it favours bilayer- 
surface binding, RHP1,100 did not promote proton permeation under 
the same conditions (Extended Data Figs. 1c, d, 3b). 

Small-angle neutron scattering (SANS) was used to characterize 
the RHPI size and oligomeric state in a lipid bilayer (Fig. 3a, Extended 
Data Fig. 5). Owing tothe large scattering length density (SLD) contrast 
against RHP1, d54-DMPC (see Methods) was used to differentiate the 
signal of RHP1 from that of the lipid background while still provid- 
ing RHP1 proton transport activity (Extended Data Fig. 3d). The SANS 
measurements provided an average RHP1 (about 19 kDa) radius of 
2.2 nm. The physical volume occupied by one dry RHP1 chain was cal- 
culated to be about 32 nm’, assuming a polymer density of 1.0 g mI". 
Using asphere model for single RHP1 chains, the SANS results lead to 
an average particle size of about 45 nm?. Thus, the RHP1 chains formed 
a unimolecular assembly under the experimental conditions used. 
Fluorescence recovery after photobleaching (FRAP) measurements 
were performed on the liposome surface using RHPI1s with molecular 
weights ranging from -7 kDato ~40 kDa (Fig. 3b). Despite large fluctua- 
tions in the measured diffusion rate (D), the correlation between D 
and the RHP1 molecular weights further excludes extensive polymer 
aggregation in the lipid bilayer. Additionally, in the proton transport 
study, promoted proton transport was observed with RHP1-to-lipid 
ratios increasing from O to 0.0017 (RHP1 concentration, 0-48 nM) 
(Fig. 3c). The trend of the pH increase is reversed at higher RHP1 con- 
centration, probably owing to the formation of RHP1 oligomers that 
were excluded from the lipid bilayer. Together, these results confirmed 
that the monomeric form of RHP1is the active species for the polymer 
concentration window reported here. 

Diffusion of alkali metal cations across a lipid bilayer requires a pore 
size larger than the size of the dehydrated cations, whereas protons can 
either diffuse through pores in the form of water—proton complexes 
or be transported without pores by hopping along HBCs””*. The ionic 
selectivity of RHP1-based transport against other alkali metal cations 
(including Li*, Na*, K*, Rb* and Cs*) was studied using an external cation 
exchange approach” (Fig. 2c). Inthe presence of both protonand cation 
gradients across the bilayer, no distinct co-transport of protons and 
alkali metal ions was observed, which was manifested by minimal pH 
changes in the interior of the liposomes. Such insensitivity indicates 


Segment number per chain 


MMA/EHMA ratio 


numbers are omitted for clarity. d, Transport performance of the RHP variants, 
evaluated using the liposome-based fluorescence assay (Fig. 2b). Inner pH 
changes of the RHP-containing liposomes (RHP:lipid ratio of 0.0017) over 200s 
after adding VIn. Error bars are1s.d. (n=3). 


that RHP1-based transport is highly selective towards protons. This 
is consistent with the proposed HBC-based transport mechanism in 
the transmembrane RHP1 segments and the unimolecular nature of 
RHP1 within the bilayer. 

Animportant hallmark of RHPs is their sequence heterogeneity. Even 
with the same ensemble composition and statistical monomer distri- 
bution, individual RHP chains feature different sequences. However, 
the statistical population and distribution of segments according to 
cumulative hydrophobicity should remain the same. Despite the het- 
erogeneity, RHP1 shows consistent transport performance. This may 
be attributed to the enhanced adaptability that is unavailable in more 
defined systems, and may be advantageous for achieving efficient 
molecular transport. Analyses were performed to gain insight into the 
interplay between differences in monomer sequences and similari- 
ties in the statistical structure of segments (that is, similar statistical 
parameters, which characterize the heterogeneity of segments) and 
how these may affect the RHP1 arrangement in lipid insertion and pro- 
ton transport. Segments rich in MMA and EHMA can be overall hydro- 
phobic while still including a few OEGMA monomers, and sufficiently 
long to span across lipid bilayers. Similarly, there exist segments rich 
in hydrophilic monomers that include a few hydrophobic units, which 
generally prefer the aqueous phase. Both are chemically heterogene- 
ous but with distinctive cumulative hydrophobicities. Within each 
RHPI1 chain, there are also abundant segments that cannot be easily 
assigned as overall hydrophobic or hydrophilic. Rather, their cumula- 
tive chemical features vary substantially depending on the choice of 
segment length and ending position. Nevertheless, they appear to be 
generally amphiphilic and will probably have a short residence time 
in either phase. Their rapid kinetics of insertion and desorption from 
bilayers, instead of a static state, may also facilitate proton transport. 

We hypothesize that the inclusion of asmall amount of OEGMA in 
the lipid bilayer core is a key requirement for the observed proton 
transport. Analysing the hydrophobic segments including OEGMA 
can be done with various criteria over the block length, number of 
OEGMAs included and flanking monomer sequence. As one example, 
Fig. 4a highlights the hydrophobic segments containing one OEGMA 
in a representative RHPI chain. An in-house-developed program was 
used to evaluate the length distribution of such segments in a total of 
4,500 RHPI (degree of polymerization, DP = 130) sequences gener- 
ated by Compositional Drift, considering compositional dispersity 
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(Extended Data Fig. 6a). As seen in Fig. 4b, RHP1 contains the segments 
that fulfil the length requirement to span the lipid bilayers. The core 
of a typical lipid bilayer (-2.7 nm for a POPC bilayer”*) can theoreti- 
cally accommodate 11 MMA-based units in their fully extended form. 
Within a certain length range, shorter and longer segments should 
also be capable of integrating into the bilayer. RHP1 segments and 
surrounding lipids can adjust their chain conformation and/or bilayer 
thickness, respectively, to adapt to hydrophobic mismatch. Further 
analysis, shown in Fig. 4c, suggests that there are a limited number 
of such segments per RHP1 chain (DP = 130). For example, an average 
of three segments per polymer chain was counted with a segmental 
length of =9, and two segments per chain with length 211. RHP1 length 
dispersity was also considered in the segment analyses. The statisti- 
cal distribution of the segment size is less affected by the polymer 
length variation once the RHP1Lis sufficiently long (DP > 40) (Extended 
Data Fig. 6b, c), although the segment numbers per chain have a posi- 
tive correlation with the RHP1 chain length (Extended Data Fig. 6d). 
We also used the probabilistic framework of the hidden Markov 
model (HMM) to predict the transmembrane segments of RHP1 
(Extended Data Fig. 7). The HMM investigation agrees with the afore- 
mentioned sequence analysis, indicating that RHP1 chains are richin 
transmembrane segments and include those that allow OEGMAs to 
embed in the bilayer core to facilitate proton transport. Such hydro- 
phobic segments are limited in their population for the molecular 
weight range studied here, thus forming a narrow crossing path for 
proton transport. 

According to the information gained, segmental heterogeneity 
afforded by four distinct monomers appears to be a key design fea- 
ture that is necessary to capture several critical features of natural 
channels with regards to proton transport. Experimentally, none of 
the three-monomer-based RHPs shows efficient transport activity 
(Extended Data Fig. 3b). It is reasonable to believe that a statistical 
design approach should also lead to other RHPs with proton trans- 
port properties. We synthesized a series of RHP variants, varying the 
MMA:EHMA ratio from 3.5:3.5 to 7:0. The overall fraction of these two 
hydrophobic monomers remains the same in order to maintain similar 
segmental distributions. The RHPs within the compositional range of 
4:3 to 6:1 MMA:EHMA could promote proton transport, albeit with 
different performances (Fig. 4d). RHP1 (MMA:EHMA = 5:2) elicits the 
best proton permeation, whereas either increasing or decreasing the 
portion of EHMA reduces the overall transport performance. Shifting 
the MMA:EHMA ratio should alter the capability of polymer insertion 
into lipid bilayers by changing the overall polymer hydrophobicity/ 
hydrophilicity and the balance between polymer-polymer and poly- 
mer-lipid interactions. Decreasing the MMA:EHMA ratio increases the 
occurrence of EHMA monomers distributed adjacent to each other. 
With a bulky ethyl hexyl side-chain, local abundance of EHMA would 
also sterically hinder the interplay between the polymer carbonyl and 
water/OEGMA, affecting the proton transport efficiency (Extended 
Data Fig. 6e, 6f). 

The present study confirms that exploring and manipulating the 
segmental heterogeneity in RHPs at the single-chain level is feasible and 
effective to mimic protein-like behaviour. Considering the vast number 
of monomers available and recent advances in polymer chemistry, the 
possibility to generate RHP-based functional materials“*”*~’ and even- 
tually marry the synthetic and biological fields are almost unlimited. 
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Methods 


Materials 

Chemicalswere purchased from SigmaAldrichand Fisher Scientificunless 
otherwise noted. Phospholipids were purchased from Avanti Polar Lipids 
Inc., including 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphoethanolamine 
(POPE), 1-palmitoyl-2-oleoyl-sn-glycero-3-phospho-(1’-rac-glycerol) 
(sodium salt) (POPG), 1-palmitoyl-2-oleoyl-glycero-3-phosphocholine 
(POPC), 1,2-dimyristoyl-sn-glycero-3-phosphocholine (DMPC) and 
1,2-dimyristoyl-d54-sn-glycero-3-phosphocholine (d54-DMPC). Water 
was Milli-Q-grade water (18.2 MO.cm). 


RHP syntheses 

RHPs were synthesized as described previously. Azobisisobutyroni- 
trile was recrystallized from ethanol. Inhibitors were removed using 
cryodistillation (methyl methacrylate, 2-ethylhexyl methacrylate) or 
by passing over a column of neutral alumina (ethylene glycol methyl 
ether methacrylate (M, = 500 g mol). 3-sulfopropyl methacrylate 
potassium salt (98%), ethyl-2(phenylcarbanothioylthio)-2-phenylac- 
etate (98%), trioxane and solvents were used as received. Polymeriza- 
tion solutions were prepared and transferred to 20-ml glass ampules. 
These solutions were subjected to four freeze-pump-thaw cycles 
and then flame-sealed at ~50 mtorr. Polymerizations were carried 
out at 80 °C for typically 6-12 h. The ampules were then quenched in 
liquid nitrogen and opened. Polymers were precipitated by dropwise 
addition to stirring pentane and then transferred to 3,500 molecular- 
weight cut-off centrifuge filters (Amicon), washed five times using 
water and lyophilized. 

'H nuclear magnetic resonance (NMR) spectra were obtained with 
either a Bruker Avance 400 spectrometer (400 MHz) using a 5-mm 
Z-gradient broadband observe probe, a Bruker Avance AV 500 spec- 
trometer (500 MHz) using aZ-gradient triple broadband inverse detec- 
tion probe, or a Bruker Avance AV 600 spectrometer (600 MHz) using 
a Z-gradient triple broadband inverse detection probe. Monomer 
conversion was measured on crude reaction mixtures in DMSO-d,, 
using trioxane as an internal standard. The RHPs were characterized 
using gel permeation chromatography with an Agilent 1260 Infin- 
ity series instrument equipped with two Agilent PolyPore columns 
(300 mmx 7.5 mm), calibrated using poly(ethylene glycol) standards. 
Dimethylformamide with 0.05 M LiBr was used as eluent at 0.7 ml min 
at 50 °C. Analyte samples at 2 mg ml7™ were filtered through 0.2-um 
polytetrafluoroethylene membranes (VWR) before injection (20 pl). 
The physiochemical properties of the synthesized RHPs are listed in 
Extended Data Table 1. 


Preparation of large unilamellar vesicles 

2.5mg of POPC ora mixture of POPE and POPG (3:1 molar ratio) was dis- 
solved in chloroform and subjected to solvent removal to make a lipid 
film, which was dried in vacuum overnight. The lipids were rehydrated 
by a0.5 ml inner buffer (10 mM HEPES-KOH, pH 6.9, 100 mM KC]) at 
37 °C for 1h. 0.5 mM pyranine was included in the inner buffer in the 
pyranine assay. The resulting suspension was subject to six cycles of 
freeze-thawing using a dry ice—-ethanol bath and a 37 °C water bath. 
Large unilamellar vesicles (LUVs) were generated by extruding 11 times 
through a 400-nm polycarbonate membrane followed by a200-nm 
polycarbonate membrane. Free dye was removed using a Sephadex 
G-25 desalting column (GE healthcare). LUV formation was confirmed 
using dynamic light scattering and transmission electron microscopy. 
The LUVs composed of POPE+POPG and of POPC only had diameters 
of 182 +16 nm and 189 + 9 nm, respectively. The lipids were quantified 
using the Stewart method”. 


Preparation of giant unilamellar vesicles 
Giant unilamellar vesicles (GUVs) for microscopy and FRAP inves- 
tigations were prepared using a reported agarose-based hydration 


method”. 300 ul of 1% melted agarose solution (low melt; gelling 
temperature, 7, ~ 26 °C; melting temperature, 7, < 65 °C; electroen- 
dosmosis, EEO < 0.12) was added to a cover glass (22 mm x 22 mm) 
and dried ona hotplate. 80 pg POPC in chloroform was evenly spread 
onthe agarose film. The chloroform was removed under vacuum. The 
film was covered by a 1-ml HEPES buffer (10 mM HEPES-KOH, pH 6.9) 
to induce GUV growth for 0.5 h. Liposomes were released by gentle 
pipetting and immobilized into a thin layer of 0.5% agarose ona glass 
slide for microscopy investigations. 


Interactions between RHPs and liposomes 

Confocal fluorescence imaging. RHPs were end-modified with a 
Texas Red fluorescent dye via thiol-maleimide chemistry. GUVs doped 
with NBD-PE (100:1 molar ratio) were incubated with the Texas Red- 
labelled RHP1s (2,000:1 molar ratio) for 0.5 h. Unbound polymers 
were rinsed off right before imaging. The imaging was performed ona 
Zeiss LSM 880 witha 63x (numerical aperture, NA, 1.4) oil or 20x (NA, 
1.0) water-immersion objectives and one Airy unit. Lasers at 488 nm 
and 594 nm were employed to excite the NBD and Texas Red, respec- 
tively. Fluorescence signals were collected in the 500-550 nm and 
605-655 nm bands. 


DSC investigation. DSC thermograms probing the lipid phase transi- 
tion were acquired using a VP-MicroCal calorimeter (GE). The samples 
and buffers were equilibrated at 5 °C for 30 min and heated from 5 °C 
to 40 °C at arate of 1°C min“. The heating and cooling process was 
repeated for several cycles. The buffer or the RHP-containing buffers 
were used as the background. The full-width at half-maximum of the 
peak was obtained with baseline correction using the Origin software 
(Microcal). 


FRAP measurement. FRAP experiments were conducted using Texas 
Red-labelled RHP1 containing-GUVs. Five prebleach images were taken 
at an attenuated laser intensity (0.5%, 561 nm). Subsequently, pho- 
tobleaching was performed through a circular region of interest of 
nominal radius 3 pm and 20 bleaching iterations using the 514-nm and 
561-nm laser lines at maximum intensity. After bleaching, recovery was 
monitored from the subsequent 95 frames with the excitation switched 
back to the attenuated intensity. A background scan and bleaching 
under the same conditions were conducted for FRAP curve correc- 
tion. The recovery curves were fitted and the diffusion coefficient was 
obtained as reported previously*°. 


Molecular dynamics simulation 

All-atom explicit solvent molecular dynamics simulation. The 
simulations were performed using GROMACS (5.0.7)*! with the 
CHARMM 36m force field”? and the recommended TIP3P water 
model, the structure of which was constrained using the SETTLE 
algorithm***. RHP1 sequences (DP = 40) were created by varying 
the random seed with fixed component ratios. The initially extended 
RHPI chains were first equilibrated in vacuum for 0.1 ns until they 
collapsed. Each RHP1 was then embedded at the centre of a bilayer 
with 120 POPC lipids. 7,200 water molecules were then added with 
approximately 0.1M KCI. The initial structure (Extended Data Fig. 2d) 
was created using Packmol”. This process mimics the experimental 
procedures used to promote RHP1 insertion by premixing RHP1 and 
lipid and the freeze-thawing process used to create transient pores 
onthe membranes. 

The potential energy was minimized, followed by equilibration 
using the NPT ensemble (constant particle number, pressure and tem- 
perature). To relax the lipid bilayer, the Z-dimensional coordinates of 
all of the POPC atoms were constrained for 0.1 ns using a force con- 
stant of 1,000 kJ mol ‘nm ~, followed by an exclusive constraint of the 
POPC phosphorous atoms for 0.1 ns. The equilibration simulation 
was performed at 310.15 K for 300 ns. Inthe production simulation at 
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298.15 K, the Nose-Hoover algorithm was employed to separately cou- 
ple the temperatures of the polymer-POPC complex and the others. 
Semiisotropic pressure coupling (Pyy=P,=1bar; time constant, 5.0 ps; 
compressibility 4.5 x 10° bar”) was applied via the Parrinello-Rah- 
man algorithm”. The neighbour search was performed up to 1.2 nm. 
The short-range Coulomb interactions were calculated up to 1.2nm 
using the particle mesh Ewald method*®” for the long-range electro- 
static interactions. The Lennard-Jones 12-6 potential was switched 
off from 0.8 nm to 1.2 nm with the force switching method. These 
parameters have been recommended for the accurate reproduction 
of the original CHARMM simulations on lipid membrane systems*°. 
A simulation integration time step of 2 fs was employed, with all the 
hydrogen-involved covalent bond lengths constrained. Each produc- 
tion simulation was performed for 500 ns, with the final configuration 
presented in Extended Data Fig. 2d. The convergence of the simulations 
was confirmed by the area per lipid and the polymer-lipid interac- 
tions (Extended Data Fig. 2e). The HBCs were visualized using the 
package VMD*!, 

Nine parallel simulations were performed with nine different poly- 
mer chains (Extended Data Fig. 2a). Five chains were found to span 
the bilayer roughly symmetrically (Fig. le, Extended Data Fig. 2b). The 
other four chains were found near the membrane-water interfaces. We 
believe that multiple factors influence the insertion of RHP1, including 
the overall monomer distributions along the RHP1 chain, the chain 
length and the transient RHP1 conformations. The sequence analysis 
showed that the average number of hydrophobic segments per RHP1 
chain increases with longer chain lengths. For relatively short RHP1 
chains (DP = 40; green line in Extended Data Fig. 6d), itis not surprising 
that not all of the nine different chains span through the lipid bilayer 
in these simulations. 


Proton transport on LUVs 

Transport on LUVs. With excitation at both 460 nm and 415 nm, the 
ratio of pyranine emission (/) at 510 nM (/.,460//ex415) WaS Monitored to 
probe the inner pH change over time. The tested fluorescence ratios 
of pyranine in the lumen of the LUVs were linearly proportional to 
DH values in the range 6.4-7.4. Purified LUV solutions were diluted 
ten times into the 1-ml sample with an outside buffer (10 mM HEPES- 
KOH pH 7.7, 100 mM KCl). Proper amounts of the RHP solutions at 
0.1 or 1.0 mg mI" were added into the LUV solutions with continu- 
ous stirring. After incubation for 5 min, /,¢o//,;; was monitored over 
500 s. 3 pl of 10 uM Vin in ethanol was added at 100 s to trigger the 
proton flux. To evaluate the ion selectivity of RHP1, purified solutions 
of RHP1-containing LUV were diluted ten times in1 ml of outside buffers 
containing one type of alkali metal cation (10 mM HEPES-KOH pH 7.7, 
100 mM MCI, M = Li, Na, K, Rb, Cs). Without Vin, changes in the inner 
pH were recorded over 200 s. The fluorescent measurements were 
performed ona Perkin Elmer KS55 fluorescence spectrometer and a 
micro-cuvette under constant stirring. 


Transport investigation on stopped-flow fluorimeter 

Proton transport rate. RHP1 and the lipids were premixed in chloro- 
form before LUV preparation. Purified RHP1-containing LUV solu- 
tions were diluted 33 times with the inner solution and mixed with 
30 nM Vin. 15 pl of the RHP1-VIn-LUV mixture in the inner solution 
was rapidly exposed to 15 pl of the outside solution using the AutoSF 
120 stopped-flow fluorimeter (KinTek) at a flow rate of 8 mI min™. With 
excitation at 460 nm, the emission of the encapsulated pyranine was 
monitored as a function of time using a 520 nm/20 nm fluorescence 
filter. To evaluate the isotope effect of the proton transport, a similar 
proton transport experiment was performed by substituting D,O for 
HO, including in the LUV preparation and purification processes. A 
deuterated base was used to titrate the pD. The pD value was meas- 
ured using a glass pH electrode and corrected by adding 0.4 to the 
pH readout. The isotope effect value was obtained from the ratio of 


the conductance in the H,O assay to that in the D,O-based assay. The 
initial pH change rate was fitted to obtain the conductance of the 
channels or RHP1 and the permeability of membranes, using equa- 
tions reported previously”. 


Water transport. LUVs were prepared ina similar way as for the proton 
transport assay, using 20 mM HEPES-KOH, pH 7.5, 100 mM KCl contain- 
ing 10 mM carboxyfluorescein. The LUVs were purified using the same 
buffer without the dye. Using stopped-flow fluorescence at 7 °C, the 
channel- or RHP1-containing LUVs in this inner solution were rapidly 
exposed to an outside solution (20 mM HEPES-KOH, pH 7.5, 1OO mM 
KCI, 100 mM sucrose) and subjected to a fluorescence measurement 
with excitation at 490 nm and the 520 nm/20 nm filter. The relative 
volume was linearly related to the relative fluorescence at the pre- 
sent conditions. The fluorescence quenching curves for the shrinking 
liposomes were fitted with a single-exponential function. The osmotic 
water permeability was calculated as described previously’. 


RHP conformation in liposomes 

SANS investigation. SANS measurements were conducted at the 
GP-SANS instrument of the High Flux Isotope Reactor at Oak Ridge 
National Laboratory“ and at the NG7 instrument of the National Insti- 
tute of Standards and Technology. The sample-to-detector distances 
were set to generate a range of momentum transfer, g, from 0.004 to 
0.6 A+, where g=4msin@/A, where 26 denotes the scattering angle from 
the incident beam. The wavelength Ais 5 A for NG7 and 4.75 A for GP- 
SANS. Small unilamellar vesicles (SUVs) composed of d54-DMPC were 
prepared as described above, using 86 vol% D,O in H,O and a100-nm 
membrane. The SLD of the 86% D,O theoretically matches the average 
SLD of d54-DMPC. Although 100% contrast match cannot be achieved 
owing to the deuterated lipid tail and H-based lipid head, using 86% D,O 
dampens the SANS signals from the liposome. The SANS measurements 
were performed at 37 °C in 2-mm-path-length quartz cells using solu- 
tions of the SUVs. The 2D scattering results were azimuthally averaged 
to produce one-dimensional SANS curves. The data were modelled 
using a three-shell vesicle model for the RHP1-free SUVs and a sphere 
model for the free RHP1. Asum of these two models was used to fit the 
SANS profiles of the SUV-RHP1 samples. All data fitting was performed 
using the SASView program (version 4.2.0, http://www.sasview.org/). 


Sequence analyses 

HLB. The HLB value was used to evaluate the solubility of monomer 
side-chains through group contribution theory. Using the equation 
HLB =7 + ¥; n,HLB, where n;is the number of the ith chemical groupin 
the molecule with corresponding value HLB;. The HLB value for each 
monomer side chain was estimated as: HLB(MMA) = 8.45, 
HLB(EHMA) = 5.12, HLB(OEGMA) = 11.4 and HLB(SPMA) = 18.5. A lower 
HLB value denotes higher hydrophobicity and a higher value means 
greater hydrophilicity. A Python program was created to continu- 
ously calculate the average segmental HLB values for a window sliding 
from the alpha to the omega ends of the simulated RHP1 chains. The 
window advanced by one monomer each time. We used a span contain- 
ing odd numbers of monomers, and assigned the average HLB value of 
that span to its middle monomer. For instance, for the window of 11 
monomers shown in Fig. 1b, the average HLB value of the span from 
monomer 1to monomer 11 was calculated and plotted at monomer 
position 6. The next HLB value, corresponding to the span from mon- 
omer 2to monomer 12, was plotted at monomer position 7. 


Hydrophobic segments. A Python program was used to analyse the 
length and number of hydrophobic segments containing one OEGMA 
inthe RHPs. The hydrophobic embedded OEGMA was set to be two or 
more monomers away from the ends of the segments. The number of 
uninterrupted hydrophobic monomers on both sides of an OEGMA 
was counted. 


Segment prediction 

HMM prediction. The HMM has a successful history in describing la- 
tent statistical structures and incorporating various constraints, de- 
spite its limitation in duration modelling. Given an observable output 
sequence of length 7 (for example, y = (),, yo, ..., 7), the basic goal for 
an HMM ‘* is to infer the corresponding hidden state sequence of the 
same length, g = (q,, qd, .--. 97). In our study, y, denotes the monomers 
and q, represents the states defined above: state 1, state 2 and state 3. 
The HMM parameters include: 

1. The initial probabilities of each state, A = {7}, where 7, = P(g, =i) 
and iis the index for the three different hidden states (that is, i=1, 2, 3). 

2. The transition probability matrix A = {a,}, where a,;=P(9.1=/1g,=0), 
that is, the probability of next step being in state /(j=1, 2, 3) ifcurrently 
itis in state i. 

3. The emission probability matrix B = {b,} is defined as 
b= P(),=Jl9,= i), where/ is the index for the four different types of 
monomers (thatis, /=1, 2, 3, 4). 

Then the joint probability of states (that is, g = (q,, do, .... 97))) and 
observed outputs (that is, y=(y,, y>, ...,7)) can be written as: 


T-1 
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The Baum-Welch algorithm” is typically used to find the unknown 
parameters A, A and B, and the Viterbi algorithm” is used to determine 
the optimal state path for each observed sequence. 

The unobserved states represent the three possible regions on RHP1s 
that prefer water, the lipid-water interface, or the core of lipid bilay- 
ers, and states 1, 2 and 3 denote the corresponding hidden states. We 
implemented a third-order HMM in which (i) the transitions between 
states were modelled by a third-order Markov chain and (ii) third- 
order emission probabilities were used to characterize the monomer 
preferences in different RHP1 regions and to model the dependencies 
between neighbouring monomers-that is, the probability of generat- 
ing amonomer ata position would depend not only onthe hidden state 
at that position, but also on the states at the previous two positions. 
Additionally, we incorporated the following rules in the HMM to bet- 
ter approximate reality and facilitate parameter inference (Extended 
Data Fig. 7b): 

1. Despite the ambiguous criteria for defining a state-3 segment, we 
began with segments full of hydrophobic monomers and hydrophobic 
segments containing one OEGMA. 

2. To differentiate between OEGMAs included in a state-3 segment 
and those adjacent to such segments, the OEGMA included was set to 
be at least two monomers away from the ends of the segments. 

3. Owing to the high polarity, SPMA was fixed to appear only in state 
linthe current setup. 

4. Hydrophobic segments (states 2 and 3) end with hydrophilic 
monomers (OEGMA and/or SPMA). Those boundary monomers, if 
notincludedinan extended state-3 segment (that is, the OEGMA within 
along hydrophobic segment), were considered to be in state 1. We 
note that when a length constraint for state 3 has to be incorporated 
(as shown in Extended Data Fig. 7b), the design of the HMM would 
be more challenging and the HMM might not perfectly represent the 
observed data distribution. 


Proton transport on SLBs 

Transport on an SLB. SUVs were prepared in a similar way to the 
LUVs, except for using a 100-nm extrusion membrane. 150 pl of 
2.5 mg ml SUVs was deposited ona clean Pd contact and incubat- 
ed overnight with mild agitation to facilitate formation of the SLB. 
Unfused SUVs were washed off using the liposome buffer. The SLB 
formation was verified by liquid atomic force microscopy charac- 
terization (Extended Data Fig. 8a). The fresh SLB was covered with 


150 ul of 0.05 mg mI RHP1 solution for 30 min. The RHP1-SLB so- 
lution was subjected to a liposome buffer rinse to remove free poly- 
mers. Current-voltage and cyclic voltammetry measurements were 
performed on a home-built system (Extended Data Fig. 8b), and 
the membrane permeability was fitted using a model described 
previously”. 


Data availability 


The data supporting the findings of this study, as well as descriptions 
of the methodologies used in the code, are available within the article 
and the Extended Data items. For reproduction purposes, the raw data 
used to generate the figures and input scripts used to produce the 
simulations are available from the Dryad Digital Repository (https:// 
doi.org/10.6078/D1VXOB). 


Code availability 


All custom scripts (sequence analyses and HMM) are available from 
the corresponding author upon request. 
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Extended Data Fig. 1| Interactions between RHPs and liposomes. a, DSC 
profiles of LUVs and RHP1- and RHP1,100-containing LUVs (POPE:POPG molar 
ratio, 3:1). LUVs were grown with RHP1 or RHP1,100 with a lipid-to-RHP molar 
ratio of 50:1anda lipid concentration of l1mg mL". The background signals of 
the buffer and the polymers were subtracted. No phase transition was 
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observed for RHP itself. The plots are shifted vertically for clarity by +O0.0001 
for RHP1,100 and +0.0002 for RHP1. b, Full-width at half-maximum of the 
lipid phase transition peaks shownina. Error bars are1s.d. (n=3). 

c,d, Representative confocal fluorescence image of POPC GUVs incubated 
with Texas Red-labelled RHP1 (c) and Texas Red-labelled RHP1,100 (d). 
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a 
b 2 
RHP1 chains monomer sequences 
1 1111411421123112111121213111132332323322 
2 3141221233112332221312121331111111112114 
3 1312123412323314311211211131112131111222 
4 1323122231311331111122131211111312121244 
5 3111111121121212342211411112312112323333 
6 4133221323321222212111113121131111114311 
7 3132123122112132224232114331131111111111 
8 1211311123231113412132122211233112114113 
9 1112111221131111312231111221231231234334 
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Extended Data Fig. 2 | All-atom molecular dynamics simulations. dashed lines. Lipids are omitted for clarity. d, Snapshots of the initial (left) 
a, Sequences of the nine RHP1 chains used in the simulations. b, Snapshot of and final (right) RHP1 conformations inthe simulations. The RHP1 chain is 
the 2nd-Sth RHP1 chains in the POPC lipid bilayer in the simulations. MMA, highlighted, along with K* (orange) and CI (green) ions. e, Convergence of the 
EHMA, OEGMA and SPMAare coloured red, pink, blue and purple, respectively. | simulations. Top, area per lipid as a function of the simulation time. Bottom, 
Lipids are shown in grey colour. c, Representative HBCs in the transmembrane Coulomb and Lennard-Jones interactions between POPC lipids and the 1st-Sth 
regions of RHP1s. The oxygen and hydrogen atoms of water molecules are RHPIs asa function of simulation time. Different colours denote the results 


coloured red and white, respectively. Hydrogen bonds are shown with red from five parallel molecular dynamics simulations. 
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Extended Data Fig. 3 | Transmembrane proton transport. a, pH gradient- 
driven proton flux through RHP1in the LUVs of POPE/POPG (molar ratio 3:1). 
b, Inner pH changes in POPC LUVs incubated with RHP1 and RHP variants over 
200s after adding VIn. ‘RHPwoM’, ‘RHPwoSP’ and ‘RHP7/0’ represent the RHP 
without MMA, the RHP without SPMA and the RHP without EHMA, 
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respectively. c, pH gradient-driven proton flux in the influx and outflux 
directions. RHP1 was added to the solutions of preformed LUVs. d, pH gradient- 
driven proton flux through RHP1in DMPC SUVs at 37 °C. RHP1-containing SUVs 
were prepared as indicated in the Methods section ‘SANS investigation’ using 
H,O-based buffers, with an RHP1:DMPC ratio of 0.001. 
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Extended Data Fig. 4 | Proton and water transport rates. a, Representative 
stopped-flow fluorescence traces of RHP1-containing POPC LUVs after rapid 
exposure to a buffer of a higher pH value. The initial and final equilibrated pH 
values are 6.85 and 7.35, respectively. VIn was added before the measurements. 
b, Calculated permeability values of RHP1-containing LUVs at various RHP1-to- 
lipid ratios. Error bars are1s.d. (n=3).c,d, Conductance (c) and initial proton 
transport rate (d) per RHP1 chain at various RHP1-to-lipid ratios. Error bars are 
1s.d. (n=3).e, Stopped-flow fluorescence traces of RHP1-containing LUVs after 
rapid exposure toa buffer of a higher pH or pD value. The initial and final 
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equilibrated pH values are 6.85 and 7.35, respectively, for the H,O-based assay. 
The initial and final pD values are 7.25 and 7.75, respectively, for the D,O-based 
assay. VIn was added to the RHP1-containing LUV solutions, except for the 
GramA-containing LUVs. f, Osmotic water permeability of VIn, GramA and 
RHP1+VIn in POPC LUVsat 7 °C. Vin (K* channel) and GramA (proton/cations/ 
water channel) are the negative and positive controls, respectively. GramA, VIn 
and RHP1 were added to the solutions of preformed LUVs. ‘Premixed’ denotes 
premixing of RHP1and the lipid before the preparation of LUVs. Error bars are 
1s.d.(n=7). 
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Extended Data Fig. 5 | SANS investigations. a, SANS profiles of free RHP1in 
86% D,O. The solid line denotes a fit using a polydisperse sphere model for 
RHP1. The signal of the solvent has been subtracted in the presented curves. 
b, Schematic illustration of the three-shell vesicle model for the SUV, showing 
the regions of the liposome core (core), inner bilayer headgroup (shell 1), lipid 
bilayer core (shell 2) and outer bilayer headgroup (shell 3).c, Structural 
parameters obtained by fitting the SANS data with the three-shell vesicle 
model for the liposome and the polydisperse sphere model for RHP1. 


Parameters that were fixed during the fitting are denoted with ‘f’; RMS 
represents root mean square. SLDs that were fixed were determined fromthe 
known sample composition, including when RHP1 was present in the bilayer, 
because the concentration was too lowto impact the SLD value of shells 
significantly. The SLD values used for the liposome core, for shells 1, 2,3 and for 
the solvent are 5.41, 1.88, 6.84, 1.88 and 5.41 10° A”, respectively. The SLD 
value for RHP1is 1.4 x10~° A “in the sphere model. 
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Extended Data Fig. 6 | Sequence analyses of RHPs. a, Statistical distribution 
of number of OEGMA monomers per RHPI chains in the 4,500 simulated RHP1 


sequences (DP =130). This distribution reflects the compositional dispersity in 


RHP1I chains. b, Statistical size distribution of hydrophobic segments with one 
OEGMA in RHPIs of various chain lengths ranging from DP = 40 to 200. The 


number of chains (4,500) was kept constant for each DP analysis. c, Normalized 


profiles of the size distributionin b. d, Number of hydrophobic segments per 
chain at various chain lengths. The counted hydrophobic segments here 
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dynamics simulation. EHMA, MMA and OEGMAare coloured pink, red and blue, 
respectively. Lipids are omitted for clarity. 


Segment number per chain 


Segment size 


tate-3 segment that must emit OEGMA with probability 1 (denoted as 


Inas 


1).c¢, Statistical size distribution of the state-3 segments predicted by 


HMM in4,500 RHP1sequences. d, Number of the predicted state-3 segments 


‘prob. 


per polymer chain. e, Statistical size distribution of predicted state-3 segments 


containing one OEGMA. f, Number of predicted state-3 segments conta 


ining 


one OEGMA per polymer chain. 4,375 out of 4,500 RHP1 chains contain at least 


one state-3 segment. 


top) assigned using the method described in Methods 


, 


a_ hydrophobic segments 


° ° 
* —MWnN 
“ Reais ee 
: _fIlltTl L , 
‘o 
2 
7 5 
wy 5 
7 a Le 
~ 5 
z 3 
o — 
5 
. Fi 
Ld 5 
"6 5 
= e 
5 
- 3 
= 3 
r T T o 
es f=} °o oO 
iM 8 3 
& om So 
: + a 
oe a @0U8UND9Q, e0UeIN99Q, 
e TrMe rss me} - 
oe na i=} f=} 
Rane 7 8 8 
bal “ = wo wo 
a . a a 
a it « ° 2° 
a Ro & 
g 
oe 8 
oo n ~ 4 5 jet) 
a a * =I 
6 
oon e 5 
ole 7 a8 s 
A 5 i 5 
es eecrere ia T T ° T T tO 
mR ae a 3 o 3 3 BS o 
8 8 8 8 8 
i. aes + S co S + aq 
i pan meen A a s 2 
is =e 5 90uaNIIOQ 90UaNDIQ 
_ ° 
z Mee —~ 
a + ) 
~ on ieee 
ie aeee pacha SY 
“ aes Hee Hei Ney 
mou ed ee eres ae 
ee ieoeess lan: (3 
Vac afle Hw ofa cee 2 
oowafancapliacce Ce) 
ee pee ae (8) 
eral ees ee ay 
aren Bee Ml la 3 ) 
S aq ce ee oo 
a ee ht eee a : 
RE: C(s 
ee Pee oe ee 
@ wofoces espe 
@ Soon sasoapl ow 
Caw eer Ane dB 
De Neer eee Nee 
@ BB ssccwc caw NN 
a oo AMONAK ME MOOR Uy 
eee «= ebeignanniniornianes & 
@ LoRwor carer oone 
© pancaansaccecne 
D annnrarannr ar an 
Ciitinaentaeees 9 


section ‘Hydrophobic segments’ and those predicted by HMM (bottom).‘1’, ‘2’, 


‘3’and ‘4’ denote MMA, OEGMA, EHMA and SPMA, respectively. b, Transition 
diagram witha state-3 segment of size >7. States 1and 2 are denoted as Sl and 


S2, respectively. State 3 here is broken into 10 states to comply with the rules 
outlined in Methods section‘HMM prediction’. S5 and S9 are the special states 


Extended Data Fig. 7 | HMM prediction. a, Comparison between state-3 


segments (size >7 
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5.61nm.d, Current-voltage sweep of the protonic device. The Pd/SLB contact 


Extended Data Table 1| Physical-chemical properties of RHPs 


MMA EHMA OEGMA SPMA Mn(kg/mol)* Min(kg/mol)+ Di 

RHP1 50 (50.8) 20 (21.5) 25 (24.1) 5 (3.7) 306 29.0 13 
RHP70+0 70 (72.1) 0 (0) 25 (23.1) 5 (4.8) 18.0 20.2 14 
RHP60+10 60 (61.9) 10 (10.0) 25 (23.5) 5 (4.6) 185 21.0 1.3 
RHP55+15 55 (51.9) 15 (12.2) 25 (33.3) 5 (2.5) 28.0 33.6 1.6 
RHP40+30 40 (40.4) 30 (30.3) 25 (24.3) 5 (5.0) 26.5 22.0 1.3 
RHP35+35 35 (36.1) 35 (36.4) 25 (23.8) 5 (3.8) 40.0 51.1 1.3 
RHPwoSP 53 21 26 0 29.2 44.4 1.3 
RHPwoM 0 40 50 10 35.6 45.9 1.2 
RHP1-7k 50 20 25 5 7.0 16.3 1.2 
RHP1-19k 50 20 25 5 18.7 18.8 14 
RHP 1-40k 50 20 25 5 40.6 40.2 1.3 
RHP 1,100 50 20 25 5 57.8 1.2 


*Determined by NMR. 


tMeasured by gel permeation chromatography, using PEG standards for calibration. 


+Numbers in parentheses are the monomer compositions determined by °C NMR (150 MHz). 
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Climate change in the Arctic is occurring rapidly, and projections suggest the 
complete loss of summer sea ice by the middle of this century’. The sensitivity of 
permanently frozen ground (permafrost) in the Northern Hemisphere to warming is 
less clear, and its long-term trends are harder to monitor than those of sea ice. Here we 
use palaeoclimate data to show that Siberian permafrost is robust to warming when 
Arctic sea ice is present, but vulnerable when it is absent. Uranium-—lead chronology of 
carbonate deposits (speleothems) in a Siberian cave located at the southern edge of 
continuous permafrost reveals periods in which the overlying ground was not 
permanently frozen. The speleothem record starts 1.5 million years ago (Ma), atime 
when greater equator-to-pole heat transport led to a warmer Northern Hemisphere’. 
The growth of the speleothems indicates that permafrost at the cave site was absent at 
that time, becoming more frequent from about 1.35 Ma, as the Northern Hemisphere 
cooled, and permanent after about 0.4 Ma. This history mirrors that of year-round sea 


ice in the Arctic Ocean, which was largely absent before about 0.4 Ma (ref. *), but 
continuously present since that date. The robustness of permafrost when sea ice is 
present, as well as the increased permafrost vulnerability when sea ice is absent, can 
be explained by changes in both heat and moisture transport. Reduced sea ice may 
contribute to warming of Arctic air* °, which can lead to warming far inland’. Open 
Arctic waters also increase the source of moisture and increase autumn snowfall over 
Siberia, insulating the ground from low winter temperatures® °. These processes 
explain the relationship between an ice-free Arctic and permafrost thawing before 
0.4 Ma. If these processes continue during modern climate change, future loss of 
summer Arctic sea ice will accelerate the thawing of Siberian permafrost. 


Arctic Ocean sea ice has declined increasingly rapidly in recent decades, 
with progressive ice thinning and increasing areas of open water during 
the summer". Complete loss of summer sea ice is expected by the mid- 
twenty-first century’. The recent loss of Arctic sea ice raises concerns 
about its effects on other aspects of the global climate system, including 
potential acceleration of permafrost thawing’. Permafrost degradation 
as aresult of anthropogenic global warming could amplify climate 
change by releasing large volumes of carbon stored in permafrost in 
the form of carbon dioxide and methane”. In addition, permafrost 
thawing increases thermokarst development, coastal erosion and liq- 
uefaction of ground previously cemented by ground ice, endangering 
infrastructures relying on permafrost as solid ground”. Establishing 
the relationship between loss of Arctic seaice and permafrost response 
is therefore an important goal. 

Understanding of the response of permafrost to changing climate 
can be improved with knowledge of past environmental conditions. 
Precisely dated growth periods of speleothems (stalagmites, stalac- 
tites and flowstones) from caves located in permafrost regions have 
proved an effective tool for the reconstruction of past permafrost 


extent and continuity”. Speleothems grow only when meteoric waters 
seep through the vadose zone into caves. When the temperature in the 
upper vadose zone falls below O °C throughout the year, water freezes, 
infiltration stops and speleothem growth ceases. Speleothems found in 
modern permafrost regions are thus relicts from warmer periods when 
permafrost was absent, or when permafrost thawed temporarily». 
Dating of these relicts allows a comparison of periods of permafrost 
absence to other aspects of the past environment. 

Here, we reconstruct permafrost dynamics in central Eastern Siberia 
over the past 1,500 thousand years, using uranium-lead (U-Pb)-dated 
speleothems from Ledyanaya Lenskaya Cave (60° 22’ 15.60” N-116° 56’ 
47.30” E) (Fig. 1) located in the zone of present-day continuous per- 
mafrost (that is, year-round frozen ground across the whole region"*; 
Extended Data Fig. 3). The study area is characterized by cold continen- 
tal climate (Dfc according to the K6ppen classification”), with mean 
July and January air temperatures of +18 °C and —32 °C, respectively, a 
mean annual air temperature (MAAT) of about —6 °C (Extended Data 
Fig. 2), and annual precipitation of about 400 mm. The record from 
Ledyanaya Lenskaya Cave is compared with data from Botovskaya Cave 


‘Geological Survey of Israel, Jerusalem, Israel. Department of Earth Sciences, University of Oxford, Oxford, UK. Department of Geography and Environmental Sciences, Northumbria 
University, Newcastle upon Tyne, UK. “Institute of Earth’s Crust, Russian Academy of Sciences, Siberian Branch, Irkutsk, Russia. °Speleoclub Arabica, Irkutsk, Russia. *e-mail: antonv@gsi.gov.il 
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Fig. 1| Study area and permafrost maps. a, Permafrost map of northern 
Eurasia with the study area marked by the black rectangle, continuous 
permafrost in purple, discontinuous permafrost in green and the area with no 
permafrost in yellow. b, Extent of permafrost types in eastern Siberia’ and the 
location of Ledyanaya Lenskaya, Botovskaya and Okhotnichya caves, marked 
by black stars. Cities and towns are marked by grey circles. Permafrost types 
(see legend) are defined by the percentage of the year-round frozen ground. 


(55° 17’ 59.03” N-105° 19’ 46.02” E) (Fig. 1), located in an area of dis- 
continuous permafrost with a MAAT of about —2 °C (Extended Data 
Figs. 2,3). 

This study continues the research of Vaks et al."*, who found that, in 
Ledyanaya Lenskaya Cave, the most recent permafrost thaw occurred 
at 429 +23 thousand years ago (ka) (the original age cited by Vaks et al."* 
is 427 + 23 ka; the age above is re-calculated using updated half-lives 
of 7*U and *°Th (ref. '8)) during the warmth of Marine Isotope Stage 
(MIS) 11°. That study was limited, however, by the approximately 500 ka 
range of uranium-thorium (U-Th) chronology, meaning that older 
samples could not be analysed. U-Pb chronology enables dating of such 
older speleothems. Here, we use 52 U-Pb ages of 11speleothems from 
Ledyanaya Lenskaya Cave to extend the age range of known Siberian 
permafrost history. A smaller number of ages were also determined 
for three samples from the more southerly and warmer Botovskaya 
Cave" (Fig. 1) (also see Methods, Extended Data Figs. 1, 3-9, and Sup- 
plementary Tables 1 and 2 for chronological methods and full results). 
Ages indicate a division of speleothem deposition in Ledyanaya Len- 
skaya Cave, and therefore of permafrost presence, into three distinct 
periods (Figs. 2a, 3a). 

During the period from1,500 to about 1,350 ka, speleothems appar- 
ently grew continuously (within the limits of analytical uncertainties), 
suggesting discontinuous or absent permafrost above the cave. Glob- 
ally, this interval spans MIS 50 to MIS 43 andis characterized by glacial— 
interglacial cycles with 41,000-yr periodicity”. Most of the analysed 
speleothems in Ledyanaya Lenskaya Cave grew in this period (7 out of 
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11) (Extended Data Fig. 4). These oldest vadose speleothems were the 
first deposited directly on the cave host rock, heralding the onset of 
vadose conditions at this site, and suggesting that, before 1,500 ka, 
the cave may have been located below the local groundwater level. 
The current groundwater table is located about 50 m below the cave’s 
entrance and controlled by the nearby Lena River. 

The period from about 1,350 kato about 400 ka is defined by intermit- 
tent speleothem growth with long-lasting hiatuses without speleothem 
deposition. These growth cessations probably indicate continuous 
periods of permafrost and are found at the time of most glacial MISs 
and some interglacial MISs. Speleothem growth, demonstrating the 
absence of permafrost, occurred during most interglacials (Fig. 2a). 
From about 1,300 ka, speleothems only grew inthe shallower portion 
of the cave (15-20 m below the surface) and not in the deeper area 
(about 60 m) (Figs. 2a, 3a, Extended Data Fig. 1). This may indicate that 
the permafrost was thawing only to depth of 15-20 m, whereas relict 
permafrost remained at greater depth, showing that the duration of 
thawing periods was relatively short and/or that MAATs were reduced 
compared with the period before 1,300 ka. 

From about 400 ka until present, speleothem growth ceased com- 
pletely and permafrost appears to have been continuous above Ledya- 
naya Lenskaya Cave" (Figs. 2a, 3a). 

Caves located farther south near Lake Baikal (Botovskaya and Okhot- 
nichya) (Fig. 1) show speleothem deposition during warm periods dur- 
ing the past 700 ka (this study and Vaks et al.* (Fig. 2a)). These southerly 
caves indicate that climate in southern Siberia was warmer than in 
Ledyanaya Lenskaya Cave, enabling deposition of speleothems, while 
in Ledyanaya Lenskaya Cave to the north, speleothem growth ceased 
completely for the entire past ~400 ka. 

On the basis of data for the past 500 ka, Vaks et al. found permafrost 
thawing at Ledyanaya Lenskaya Cave during the unusual warmth of 
MIS 11 (429 + 23 ka) but not in younger interglacials. They suggested 
that an increase in global mean surface temperature of 1.5 °C (above 
pre-industrial levels) represents the threshold above which continu- 
ous permafrost thaws at its southern fringes. Our results indicate 
that substantial speleothem deposition occurred before MIS 11, when 
global mean surface temperatures (as indicated by Pacific warm pool 
sea surface temperatures (SSTs)) were lower than those of MIS 11 (for 
example, MISs 25, 19 and 15) and even lower than today (for example, 
MIS 23; Fig. 2c)”°. These earlier periods of speleothem growth indicate 
that global mean surface temperature is not the only control on the 
extent of Siberian permafrost. Other possible controls may include: 
(1) local summer insolation; (2) palaeogeographic changes; (3) greater 
poleward heat transport inthe Northern Hemisphere, leading to rela- 
tively warmer conditions inthe North Atlantic, Arctic, and/or over the 
Eurasian landmass; or (4) Arctic Ocean SSTs and the extent of Arctic 
summer sea-ice cover. 

The intensity of summer insolation at latitude 60° N (Fig. 2d)” may 
directly affect Siberian summer temperatures and therefore influence 
permafrost thawing. Many periods of speleothem deposition occurred 
whenJuly insolation was high (that is, >500 Wm”), but there is no direct 
relationship between insolation and thawing of permafrost. No thaw- 
ing took place during periods of insolation >500 W m” during the past 
400 ka, but thawing did occur at much lower insolation earlier in the 
record. Local summer insolation is therefore not the key factor deter- 
mining the presence of permafrost above Ledyanaya Lenskaya Cave. 

Arctic palaeogeographic conditions during interglacials of the 
entire past 1,500 ka were similar to present, with the open Bering Strait 
enabling water exchange between the Pacific and Arctic oceans”. The 
Atlantic meridional overturning circulation (AMOC) transports heat 
from the tropics to the northern Atlantic Ocean, thereby increasing 
the heat flux to high latitudes in continental Eurasia and the Arctic”. 
The period between about 2.4 and 1.3 million years ago (Ma) was char- 
acterized by enhanced AMOC, causing heat piracy from the Southern 
Hemisphere to the Northern Hemisphere, which was consequently 
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Fig. 2| Siberian speleothem deposition periods compared with records 

of MISs, Pacific warm pool SSTs and July mean insolation at 60° N. 

a, Distribution of speleothem U-Pb and U-Th ages (+20) in Ledyanaya Lenskaya 
Cave (purple circles) and in Botovskaya Cave (light blue circles) in time (ka) and 
space (latitude ° N). Ages of Ledyanaya Lenskaya SLL14 speleothems (60 m 
below the surface) are marked by dark purple circles, and of SLL9/SLL10 
speleothems (15-20 m below the surface) by light purple circles. Purple shaded 
areas show how periods of speleothem growth in Ledyanaya Lenskaya Cave 
relate to other climatic records. b, Benthic &°O stack” with glacial MIS 


relatively warm. This enhanced northward heat flux caused substan- 
tially higher SSTs in the North Atlantic than those of most of the Mid- 
dle-Late Quaternary and Holocene?’ (Fig. 3c, d). After about 1,300 ka, 
the AMOC gradually weakened, leading to a concurrent decrease of 
the North Atlantic SST of 1.5 °C to 3 °C (ref. 4; Fig. 3c, d). This is likely 
to cause progressive cooling of the Eurasian landmass ona long-term 
scale and could influence the presence of permafrost (which is much 
less common early inthe record when the Northern Hemisphere was 
warmer on average). Again, there is no simple relationship between 
speleothem growth in Ledyanaya Lenskaya Cave and North Atlantic 
warmth: periods of significant North Atlantic warmth during the 
past 400 ka are not associated with permafrost thawing. For exam- 
ple: MIS 9 (Fig. 3c, d) and MIS 5 (Fig. 3d) were warmer than all other 
times in the past about 1,300 ka, but there is no permafrost thawing 
above Ledyanaya Lenskaya Cave. Of these, MIS 5 is particularly nota- 
ble because both the Pacific warm pool (Fig. 2c) and North Atlantic 
(Fig. 3d) were warmer than many earlier thaw periods, and as warm 
as interglacials in the period 1,500-1,300 ka when the permafrost 


numbers below and interglacial MIS numbers above, c, Pacific warm pool 
magnesium/calcium-inferred SST changes during the past 1,400 ka, which are 
considered to bea reasonable reflection of changes in the mean Earth surface 
temperature”’. The pre-industrial SST is marked by the lower dotted red 
horizontal line and the abbreviation ‘PI’. The SST 1.5 °C higher than the pre- 
industrial level is marked by the upper dotted red line and the abbreviation 
‘+1.5 °C’, Ocean Drilling Program (ODP) site 806. d, July mean insolation at 
60°N”, 


above the Ledyanaya Lenskaya Cave was discontinuous or absent. 
Yet, there is no evidence for permafrost thawing above Ledyanaya 
Lenskaya Cave at MIS 5 (Fig. 3a). 

Itis striking that the permanent presence of permafrost since 400 ka 
initiates at the same time as perennial sea ice is established in the Arctic 
Ocean (Fig. 3a, e). The appearance of perennial sea ice is marked by 
an abrupt increase in the appearance of sea-ice-associated fauna in 
the western Arctic Ocean’ (Fig. 3e) and the disappearance from the 
Arctic Ocean of fauna that today is found in the North Atlantic”. The 
perennial Arctic sea ice (Fig. 3e) remained intact even during MIS 9 and 
MIS 5e, when both the tropics (Fig. 2c) and North Atlantic (Fig. 3c, d) 
were particularly warm. 

Climatic models for present and past climates indicate a rela- 
tionship between Arctic sea ice and Eurasian permafrost caused by 
changes in atmospheric heat transport®”°. Removal of Arctic sea ice 
warms the air above the sea surface’, increasing the moisture content 
of the atmosphere, and therefore increasing the transport of both 
sensible and latent heat from the ocean via atmospheric transport 
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Fig. 3 | Siberian speleothem deposition periods compared with records of 
MISs, North Atlanic SST and presence of seaice inthe Arctic Ocean. 

a, Distribution of speleothem U-Pb and U-Th ages (+20) intime and space 
(details in caption of Fig. 2).b, Benthic 880 stack” with glacial MIS numbers 
below and interglacial MIS numbers above. c, North Atlantic mid-latitude 

(41° N, 33° W) alkenone unsaturation ratio SST as recorded in ODP 607 core”, 
and with an 81-data-point (-200 ka) running average showing an approximately 


to continental interiors*. Resulting warming can penetrate up to 
1,500 km inland, peaking in autumn, leading to permafrost degrada- 
tion’. An open Arctic Ocean also leads directly to increased transport 
of moisture from the sea to the continent. High deuterium-excess 
values of autumn atmospheric precipitation in Siberia show that the 
open Arctic Ocean comprises a substantial moisture source that is 
shut down when the Arctic ice cover is established in early winter”. 
Models show that future increase in Arctic precipitation will come 
mainly from evaporation from the Arctic Ocean due to retreating sea 
ice, and not from enhanced moisture transport from lower latitudes”®. 
At present, decreased sea-ice cover inthe autumn increases moisture 
and leads to heavier autumn snowfall over Siberia®®. Thicker autumn 
snow cover insulates the ground from cold winter temperatures, 
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3 °C decrease in the long-term SST. d, North Atlantic high-latitude (58° N, 16° W) 
alkenone unsaturation ratio SST as recorded in ODP 982 core”, and with an 
81-data-point (-200 ka) running average showing an approximately 1.5 °C 
decrease inthe long-term SST. e, Per cent abundance of the genus Polycope, 
abenthic opportunistic genus signifying high local surface productivity in 
Arctic sea-ice margin environments, and therefore presence of the seaice in 
Northwind Ridge, western Arctic Ocean’. 


increasing winter and mean annual ground temperatures”. This 
effect, well known to influence seasonal vegetation”, has not pre- 
viously been recognized as a substantial long-term control on the 
extent of permafrost, but the strong relationship between perennial 
sea ice and permafrost observed in this study suggests that it may 
be animportant controlling factor. The appearance of perennial sea 
ice about 400 ka decreases the Arctic heat (sensible and latent) and 
moisture source”, cooling the Arctic air and reducing snowfall on the 
continent. This may lead to poorer ground insulation and a decrease 
in ground temperature, assisting the establishment of continuous 
permafrost”. Future loss of summer sea ice may have the opposite 
effect—increasing ground temperatures and, subsequently, increas- 
ing speed of permafrost thawing. 


The stability of continuous permafrost near its modern southern 
boundary in Siberia hinges on perennial sea-ice cover in the Arctic 
Ocean. Although the speleothems’ record of this study also indi- 
cates intervals of permafrost before the formation of perennial sea 
ice at about 400 ka, such permafrost was prone to thawing in times 
of higher Northern Atlantic and/or global mean temperature. The 
long-term cooling of the Arctic Ocean that occurred between around 
1,350 kaand around 400 ka eventually reached a temperature thresh- 
old for the formation of perennial Arctic sea ice, which stabilized 
the presence of continuous permafrost in Siberian regions where it 
remains today. This record indicates that, under future open-water 
Arctic scenarios, as predicted for later this century”, this stabiliza- 
tion is likely to be removed, accelerating the northerly retreat of 
continuous permafrost. 
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Methods 


Description of the caves 
Ledyanaya Lenskaya Cave. The cave is located 116 km east-southeast 
of the town of Lensk, 180 m above sea level, with the cave entrance 
located on the northeastern riverbank in a cliff about 50 m above the 
Lena River. The local vegetation is subboreal taiga forest®. The cave is 
developed in Ediacaran limestones and marls and its lengthis mapped 
to about 21 m. An approximately 90-m-long main passage ascends by 
about 15° in a north-northeast direction, ending ina central roughly 
55-m-long and 10-20-m-wide hall, with a ceiling height up to 8 m. The 
hall is mostly filled with massive ice several metres thick. A narrow pas- 
sage leads from its top to the cave’s upper section. The latter is about 
70 mlong, ascending by about 20°, and consists of two chambers con- 
nected by a narrow passage and partly filled by ice. The walls of these 
chambers are partly covered by flowstone and stalactites (SLL14, taken 
in 2014; Extended Data Fig. 1A). Around 50 m fromthe main entrance, a 
narrowslightly ascending roughly 40-m-long corridor splits from the 
main passage in aeast-northeast direction, and ends in small chambers 
in which vadose speleothems were collected (SLL9 and SLL10, takenin 
2009-2010; Extended Data Fig. 1A). The depth of this chamber below 
the surface is 15-20 m, and the depth of the large hall is about 60 m. 
The cave air temperature was monitored using HOBO UA Pendant 
Temperature Loggers from March 2010 to May 2013 (logger ‘Siberia 
7’) and November 2013 (logger ‘Siberia 6’) (Extended Data Fig. 2A, B). 
The loggers’ measurement uncertainty is +0.47 °C. The temperature 
in the central hall with massive ice was measured with logger ‘Siberia 
6’ (Extended Data Fig. 1A) and was found to vary between —0.1 °C and 
0.0 °C in early spring, and between +0.6 °C and +0.7 °C in summer 
(Extended Data Fig. 2A). Logger ‘Siberia 7’ measured the temperature 
in the chamber with speleothems SLL9 and SLL10, and the tempera- 
ture there is relatively stable at about +0.3 °C (Extended Data Fig. 2A). 
Although in both places temperatures are slightly above zero, no water 
seepage or speleothem growth was found beyond several metres from 
the entrance, showing that the rock above the cave is frozen year-round. 
Inthe uppermost cave chambers where SLL14 speleothem samples were 
collected, the temperature inJanuary 2014 was 0 °C and the ice was dry. 
According to Lensk meteorological station (Extended Data Fig. 2B) the 
MAAT between 2010 and 2013 was about —6 °C, thus the cave is 6 °C 
warmer. The cave’s ascending morphology, with the entrance being 
its topographically lowest point, causes warm humid air to be trapped 
inside during the summer months. Formation of ice inthe uppermost 
parts of the cave shows that the cooling that creates the ice occurs when 
warm and moist air comes in contact with the subzero temperature of 
the cave walls*. This is also supported by the permafrost map, which 
shows continuous permafrost (type 18) above the Ledyanaya Lenskaya 
Cave*? (Extended Data Fig. 3A). More information about the cave can 
be found in the Supplementary Online Materials of Vaks et al."*. 


Botovskaya Cave. The cave system is located 58 km north-northeast 
of the town of Zhigalovo, 750 m above sea level, at the head of a small 
valley northeast of the Boti River, which joins the Lena River 8.6 km 
southeast of the cave. The vegetation is subboreal taiga forest”. The 
cave is located in Ordovician limestones and sandstones, and is the 
longest cave system in Russia, reaching a total length of >69 km anda 
depth of up to 130 m below the surface, comprising a horizontal maze 
of thousands of passages developed along the criss-cross system of 
tectonic fissures (Extended Data Fig. 1B). 

Cave air temperatures were monitored using HOBO UA Pendant 
Temp loggers, from February 2010 to February 2016. The tempera- 
tures in the deeper parts of the cave are stable at +1.9 + 0.3 °C but vary 
from —0.2 °C to +1.6 °C near the entrance, with minima in winter and 
maxima in summer (Extended Data Fig. 2C). Surface air temperatures 
(SAT) outside the cave were also monitored by a HOBO logger tied on 
the northern shady side of a big tree 2 m from the ground. The mean 


annual SAT is -2.1°C, varying between summer and winter extremes 
from +37 °C to<-40 °C (-40 °C is the minimum limit of the logger; dur- 
ing two nights in winter 2010-2011 the minimum temperature dropped 
below -40 °C). The MAAT is thus lower than the cave air temperature 
by 3-4 °C (Extended Data Fig. 2D). 

Regional permafrost is discontinuous and found below the Boti River 
Valley and its slopes, but is absent from the plateau above the inner 
parts of Botovskaya Cave® (Extended Data Fig. 3B). The eastern part 
of the cave (‘New World’) is where the most water seepage and modern 
speleothem deposition occurs today (Extended Data Fig. 3B). This 
cave section is located below a small surface depression that hosts an 
intermittent stream that allows rain/snowmelt water to seep into the 
cave. Thermal energy from infiltrating water probably contributes 
to an absence of permafrost in this area. Speleothem samples were 
collected in the ‘New World’ section of the cave. The western part of 
the cave (‘Old World’) is drier, with some passages near the entrance 
clogged with massive ice. 


Speleothem petrography 

In Ledyanaya Lenskaya Cave, the speleothem cover on the cave walls 
and floor is usually 5-10 cm thick. Speleothems consist of several calcite 
horizons, each composed of brown or grey columnar calcite crystals 
usually free from detrital material (Extended Data Figs. 4, 6A, B). These 
calcite horizons are separated by whitish or beige thin (<2 mm) layers 
of microcrystalline calcite, sometimes containing pieces of marl and 
limestone host rock. These layers represent growth hiatuses, some- 
times with pieces of broken host rock from the cave ceiling remaining 
onthe ancient speleothem surface. 

In Botovskaya Cave, speleothem deposition is much more wide- 
spread and massive than in Ledyanaya Lenskaya Cave. Here, the thick- 
ness of the speleothems is many tens of centimetres, showing that 
compared with Ledyanaya Lenskaya Cave, the humid and warmer 
climate of the area provided speleothems with better opportunities 
to grow (Extended Data Fig. 5). Active speleothems are also found in 
this cave. Unlike in Ledyanaya Lenskaya Cave, most speleothems in 
Botovskaya Cave are composed of aragonite (Extended Data Fig. 6C, D), 
but with some calcite speleothems (for example, stalagmite SB-6919) 
and some speleothems comprising alternate aragonite and calcite 
layers (for example, stalagmite SB-01112). Apparent growth breaks, 
sometimes separating calcite and aragonite layers, are common. 


Methods used in this study 

The speleothems were sectioned using a diamond saw. For the purpose 
of U-Th dating, between 10 and 250 mg of powder was drilled from 
each sampled horizon using 0.8-1.0 mm drill bits. Speleothem min- 
eralogy was examined at ETH Zurich, Switzerland, using a Bruker, AXS 
D8 Advance powder XRD diffractometer, equipped with a scintillation 
counter and an automatic sampler. Macro and microscope inspection 
shows that all horizons chosen for dating had atypical columnar petrog- 
raphy in calcite and fibrous petrography in aragonite (Extended Data 
Fig. 6), with almost no voids or re-crystallization marks, suggesting 
that they probably maintained closed-system conditions for U-series 
chronology (except for some growth hiatuses, such as that in SB-01112; 
Extended Data Fig. 5). 


U-Pb chronology 

Analytical methodology. Ages were determined by isotope dilution 
using a mixed *°U-*Pb—Th spike* and a first-generation Nu Plas- 
ma multi-collector inductively coupled plasma mass spectrometer. 
Subsamples were cut using a small diamond saw and transferred to 
acid-cleaned (1-2 M HNO, for >3 d) 15 ml polypropylene bottles. The 
subsamples were then sonicated repeatedly in18 MO. cm water until no 
suspended fines were visible, rinsing between each wash. Subsamples 
were then acid cleaned twice in distilled 2% HNO, with sonication to re- 
move any residual dirt. Following each wash, samples were thoroughly 


washed with 18 MO cm water and sonicated to remove any residual acid 
and dislodged surface material. Each acid wash was removed before 
the acid reaction completed, to prevent adsorption of dissolved ions 
back onto the surface of the sample. 

One totwo drops (about 30 ll per drop) of spike were added directly 
to the acid-cleaned carbonate and gently agitated to mix as the spike 
fully dissolved the sample. Once the visible reaction was complete, the 
solution was diluted to about 15 ml with 18 MO cm water, thoroughly 
shaken to homogenize and then immediately analysed, with no pre- 
concentration of U and Pb. 

Analyses followed a six-step routine. In steps 0, 1,2 and 3,7°8Pb,?°’Pb, 
206Db, 2°4Pb + 2°4Hg and 7”Hg were measured using three ion counters 
at 2 AMU spacing. The relative gains of the ion counters were deter- 
mined by stepping Pb + *°*Hg alternately into each collector. ?”Hg 
was monitored to correct for *“Hg on 7™Pb. In steps 4 and 5, °U was 
measured ona Faraday cup, with ?*U and **°U measured alternately 
on both a Faraday cup and anion counter; this allows the use of the 
Faraday/Faraday 7°U/?*U ratio, or the Faraday/ion counter 7°U/?*U 
ratio depending on the **U signal intensity. Faraday/Faraday ratios 
were normally used for both %U/*U and 78U/?*U. 

Mass fractionation was estimated using the measured 7*°U/7*U ratio 
of the samples and an assumed natural value of 137.75 (ref. *). On the 
basis of previous testing of the instrument, the mass fractionation for 
Pb was assumed to be 2% per AMU higher than for U (ref. *°). 

Before the first analysis, the Nu Instruments DSN100 sample intro- 
duction system and sample lines were cleaned with 10% HNO;, 2% HNO; 
and 18 MOcmwater. A dedicated set of B-type cones reserved for very- 
low-level Pb work were used. These were gently cleaned by rinsing with 
deionized water before use to remove excessive Ca build-up from the 
skimmer orifice. As far as possible, the surface coating on the cones was 
not disturbed. The instrument was then initially tuned and optimized 
with a 100 ppt thallium (TI) solution and diluted natural U solution. 
Intentional addition of Pb was avoided during tuning to prevent re- 
contamination of the instrument, but sufficient Pb blank is present in 
the TI solution to see the Pb peaks on theion counters. Peak shape and 
optimization was then re-checked on samples; focusing settings for the 
zoom optics often changed substantially from the nominally clean Tl 
solution to the matrix-heavy samples, especially following cleaning of 
the cones. Gas flows were also re-optimized to suppress interferences 
(probably from Sr,0,), which manifest as superimposed peaks about 
0.2 AMU lighter than the Pb peaks, especially on 7°°Pb. Optimization 
was checked again after an initial couple of sacrificial analyses and 
regularly during the analytical session. The DSN100 was re-cleaned with 
18 MO. cm water every 1-2 d to remove Ca build-up and U and Pb blank. 

Separate sample aliquots up to about 0.2 g were dissolved and puri- 
fied to obtain U cuts for measurement of the “*U/**U ratio. Purifica- 
tion used 2 ml columns with AGIX8 anion exchange resin. Samples 
were loaded in, and Ca eluted with about 10 M HCI. U was eluted with 
18 MO cm water. The purified U was measured onthe same instrument, 
with 74U and 7°U measured on ion counter and Faraday collectors, 
respectively. Standard bracketing with CRM145 (CRM112A) was used 
to correct both for mass fractionation and ion-counter gain. 


Non-radiogenic Pb correction. Model ages were calculated from 
each pair of U-Pb and *U/5U analyses. The “°U—*”°Pb decay provides 
the age data used here, but the 7U-?°’Pb system was also measured 
and provides an assessment of concordance, and thus confidence in 
obtained ages. 

Ages were calculated using an estimated ?°°Pb/*°°Pb (and 2°°Pb/*"’Pb) 
ratio for the initial non-radiogenic Pb and, the modern-day measured 
disequilibrium inthe?*U/“U ratio to constrain the initial ?*U/7U ratio. 
28Pb is assumed to be entirely non-radiogenicon the basis that This typi- 
cally at very lowconcentrationin speleothems and that samples are young 
compared with the Th half-life. Common 7°SPb/*Pb (and 7°8Pb/*°’Pb) 
for the non-radiogenic Pb correction was estimated by acombination of: 


(1) Identifying and analysing unradiogenic parts of the sample, not 
greatly modified by Pb from decay. 

(2) Retrospectively picking approximate isochrons from the data, on 
the basis that given a large number of analyses the following are likely: 
(a) age overlap between samples/subsamples can be expected and 
hence some clumping of analyses along mixing lines between the initial 
Pb composition and the radiogenic composition for a given (approxi- 
mate) age; (b) data should fan around the initial Pb composition; 
(c) subsamples of roughly the same age, can, toa first approximation be 
grouped based on the observed***U/*U ratio. The latter is somewhat 
limited by initial ?“U/**8U variation, but surviving *U disequilibrium 
decreases by a factor of 2 for each *U half-life, so for a spread of ages 
over afew hundred thousand years or more, the variations due to decay 
of *Uxs will dominate over variations in the initial ratio. 

(3) Linear regressions*®** through groups of data of similar age in 
234U (or 38U)/7°°Pb — 2°SPb/?’Pb isotope space allow an estimate of the 
initial 2°SPb/?°°Pb ratio ?>U/?"Pb -7°8Pb/?"’Pb isotope space for the ini- 
tial?°SPb/*°’Pb ratio). Only groups containing relatively non-radiogenic 
analyses (ideally stratigraphically bracketed by more radiogenic analy- 
ses) were used, to minimize the effect of incorrectly grouping samples 
of different age. The groupings used and regression results are shown 
in Supplementary Table 1 and illustrated in Extended Data Fig. 7. 

Acommon2*Pb/2°°Pb ratio of 1.471+ 0.100 (and 2°SPb/°’Pb ratio of 
2.465 + 0.136) (95% confidence) was used for the Ledyanaya Lenskaya 
Cave samples (Extended Data Fig. 7). The former value is based mainly 
onasingle grouping of samples that include the least-radiogenic Ledya- 
naya Lenskaya analysis, but is in agreement with a second generally 
more-radiogenic grouping. All other Ledyanaya Lenskaya data (except 
data rejected in Supplementary Table 1) are consistent with this com- 
mon ”°$Pb/*’Pb ratio (Extended Data Fig. 7). 

For Botovskaya Cave samples, which include highly non-radiogenic 
material, acommon 2°Pb/2Pb ratio of 1.997 + 0.213 (and 7°°Pb/?°’Pb 
ratio of 2.419 + 0.123) (95% confidence) was determined ina similar way. 
As the Botovskaya data also includes some analyses that are almost 
entirely non-radiogenic, these have also been taken into account. Sam- 
ple groupings and regression results are again shown in Extended Data 
Fig. 7 and Supplementary Table 1. 

The rather different values of the common Pb composition between 
the two caves are probably attributable to host-rock composition. 
Botovskaya and Ledyanaya Lenskaya caves are hosted in rocks of Ordo- 
vician and Late Proterozoic age, respectively, which have had long 
periods to evolve distinctive Pb compositions before the formation 
of the speleothems they now host. 

The use of model ages involves some degree of assumption about the 
uniformity of the common Pb composition. Consequently, an indica- 
tion of the sensitivity of a particular age tothe common Pb correction 
is givenin Supplementary Table 1 and Extended Data Fig. 7, and found 
to be small compared with quoted uncertainties. 

Pb blanks have not been separately corrected for and are dealt with as 
part of the total correction for non-radiogenic Pb. Given that anumber 
of analyses yielded >99% radiogenic”°Pb, the Pb blank can be consid- 
ered a generally minor source of non-radiogenic Pb. Sample 7“Pb was 
corrected in the isotope dilution calculation using 7°°Pb as a proxy, 
assuming a natural ?°°Pb/*™Pb ratio of 37.1 + 10 (95% confidence). For 
most analyses, >99% of the total *“Pb originated from the tracer, so 
the correction is small. 

Ages were calculated from the common-Pb-corrected 7°U/*°°Pb (and 
235 /?°7Pb) ratio and measured ***U/*°U ratio using in-house software. 
Uncertainties were estimated using a Monte Carlo approach. Initial 
230Th and ~'Pa are assumed to have been absent in the age/concordia 
calculations. Initial “*U is determined based on the present-day 4U/?2U 
ratio as part of the age calculation, much as in the U-Th chronology. 

The decay constants used are: °U, 1.55125 x 107°; **U, 2.82203 x 10°; 
230Th, 9.17055 x 10°; Ra, 4.33488 x 10*; ?*U, 9.8485 x 107; 7'Pa, 
2.11583 x 105; ??Th, 4.9475 x 10™ (refs. 183738), 
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Asacross-check on this methodology, two layers of stalagmite SLL10- 
6, Gand F, were also analysed following the method of Mason et al.™, 
which involved chemical purification of Pb from subsamples to gener- 
ate isochron ages. Within uncertainty, these agree with the analyses 
obtained with the simplified protocol outlined above. U-Pb ages were 
also cross-checked against U-Th ages determined for the parts of the 
samples <0.5 Ma old (Supplementary Tables 1, 2, Extended Data Fig. 8). 


Screening of data and data quality 

Ledyanaya Lenskaya Cave. Of the 59 U-Pb analyses (Supplementary 
Table 1), °U-**U-*’Pb ages were included from 52 analyses (Extended 
Data Fig. 8A). Of the excluded analyses, one has no corresponding 
234 /38LU) measurement (SLL10-4-B bottom), so an age cannot be calcu- 
lated. Three analyses (two from SLL10-6-B and one from SLL10-6-B/C 
hiatus) are non-radiogenic and yield ages with unhelpfully large uncer- 
tainties, though usefully help to constrain the common Pbcorrection. 
Two analyses (SLL14-1-C centre and SLL9-1-A2) yield >1.5 Ma apparent 
ages, which are not reproduced in other analyses and appear inconsist- 
ent with the “*U/5U measurements; and one analysis (SLL10-4-A top) 
is both out of stratigraphic order and in disagreement with two other 
analyses from the same layer. 

The remaining 52 ages are consistently in stratigraphic order. The 
about 1.3-1.5 Ma age cluster is useful for demonstrating analytical 
robustness, since the common Pb correction in these analyses is 
mostly small and layers can be dated in stratigraphic order with 
about 40 ka age resolution, even without pre-concentration of Pb. 
The replication of ages of many growth periods between different 
samples, even where the magnitude of the common Pb correction 
varies substantially, helps to validate the common Pb value used to 
calculate the model ages and corroborates the analytical robustness 
of the method. The agreement of the ages for SLL10-6-Ftop and SLL10- 
6-G with and without pre-concentration of U and Pb should also be 
noted (Extended Data Fig. 8A). 

235| J-?07b ages are also given for reference in Supplementary Table1 
and are mostly concordant with the *8U-**U-*Pb ages within error 
(Extended Data Fig. 9) (or, in five cases, show only slight discordance). 
These small discordances can probably be attributed to uncertainties 
inthe common Pb correction, and could be accounted for by shifts in 
the common Pb composition of a few percent. 

Samples SLL10-4C, SLL10-5B, SLL10-9A, SLL10-9C and SLL14-1B bot- 
tom have notably low 2>U-?’Pb ages relative to their °U-*4U—*°°Pb 
ages. None of these 28U-*4U-*Pb ages fails on the grounds of being 
inconsistent with stratigraphic position and they tend to replicate well 
in other samples (Extended Data Fig. 8). It is likely that this discord- 
ance is an analytical artefact, specifically incomplete elimination of 
the molecular interference on*°SPb. An extraneous contribution on 
208Pb will lead to overcorrection of the common Pb. Since””Pb is typi- 
cally some 20-30 times more sensitive (Supplementary Table 1) to the 
common Pb correction, a small overcorrection on the *°°Pb can cor- 
respond to a significant overcorrection on ”’Pb, hence the **U-*”’ Pb 
ages appear anomalously young while the °U-***U-°Pb ages remain 
stratigraphically consistent. 

The (near-)concordance of the majority of the ages provides confi- 
dence in the data from an analytical standpoint. 


Botovskaya Cave <0.7 Ma samples. Of the 13 analyses from Botovskaya 
Cave, 12 ?8U-*4U-*°6Pb ages were calculated. The analysis that failed 
to produce an age was from non-radiogenic calcite G layer in SB-01112, 
which is bracketed between more radiogenic aragonite layers, and so 
it still useful for constraining the common Pb composition. Ages fall 
between 0.4 and 0.7 Ma (Supplementary Table 1), and are in strati- 
graphic order overlapping well with available U-Th ages (Extended Data 
Fig. 8B).?°U-*’Pb ages could only be calculated for four analyses and 
are of low precision, but they are all concordant with the *8U*U-*"°Pb 
ages (Supplementary Table 1). 


Botovskaya Cave pre-0.7 Ma samples. Botovskaya Cave shows an 
extensive 2 Ma and older record, which is the subject of ongoing inves- 
tigation. The data of some of these samples are used to better constrain 
the common Pb correction inthe present work (Supplementary Table 1, 
Extended Data Fig. 7), although the ages are not used here because 
no pre-0.7 Ma Botovskaya samples overlapping the older part of the 
Ledyanaya Lenskaya record have yet been identified. 


U-Th chronology. The U and Thanalysis, as well as U-Th ages calcula- 
tions were performed following the Supplementary Online Methods of 
Vaks et al.’*. Twelve layers of five samples from Botovskaya Cave were 
dated. Dating results and their correction for initial Th were calculated 
using Isoplot 4.15” and are presented in Supplementary Table 2. ?°U 
concentrations in calcite vary between 0.44 and 1.25 ppm, whereas in 
aragonite they vary between 40.4 and 136.7 ppm.*”Th concentrations 
usually vary between 0.06 and 3.15 ppb. ?°°Th/*Th activity ratios inthe 
analysed samples varied between 6,322 and >10°, and the?” Th/?°U ac- 
tivity ratios varied between 4.78 x 10“ and 8.94 x10”. Thus, correction 
for Thinitia Was Negligible. Two ages from layer SB-01112-D were rejected 
because this layer is adjacent to a horizon with high porosity, represent- 
ing ajoint between a growth hiatus and acrack in the speleothem. The 
lower part of layer ‘SB-01112-D bottom’ shows an age reversal, possibly 
due to U leaching. Other U-Th ages shown in Extended Data Figs. 4, 5,8 
are taken from Vaks et al.", slightly modified using updated half-lives 
of *U and?°Th (ref. ). 


The ability of Arctic air to reach Ledyanaya Lenskaya Cave site 
Arctic air masses bring polar air southward to substantially influ- 
ence Siberian weather. In particular, these air masses are capable of 
bringing latent heat and moisture from the unfrozen Arctic Ocean 
into the Siberian continental interior during the period of October- 
early November. During this time of the year, the Arctic Oceanis still 
partly unfrozen and snow cover is forming over the Siberian land- 
mass. To demonstrate that these incursions of Arctic air are capable 
of reaching the site of Ledyanaya Lenskaya Cave, we show 12 examples 
of such snow events above Ledyanaya Lenskaya Cave in Extended 
Data Fig. 10. Three trajectories for air at elevations of 500, 1,500 and 
5,000 m were calculated for each event with at least one trajectory 
in each event starting in the Arctic Ocean. These trajectories were 
assessed using the NOAA Hysplit program (https://ready.arl.noaa. 
gov/HYSPLIT.php)*°*.. The proper timing of the weather events men- 
tioned above was received from log data of the Lensk meteorological 
station (number 24923), 139 km northwest from Ledyanaya Lenskaya 
Cave (https://rp5.ru/Ioroga_B Jlencxe, PecnyOmuka_Caxa (Akyrus) 
(in Russian)*), 
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Extended Data Fig. 1| Caves’ maps. A, Map and cross-section of Ledyanaya Lenskaya Cave, with speleothem and temperature logger positions. B, Map of 
Botovskaya Cave with locations of speleothems and temperature loggers. 
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Between 2010 and 2014, temperatures were monitored deep inside the cave, 

whereas between 2015 and 2016 loggers were placed closer to the entrance. 

D, Comparison of Botovskaya Cave temperatures with mean surface 

temperature and surface temperature changes between February 2010 and 

July 2014 (data from temperature logger outside the cave). 


Extended Data Fig. 2| Temperature data inside and outside caves. 

A, Ledyanaya Lenskaya Cave air temperatures between March 2010 and 
December 2013. B, Comparison of air temperature in Ledyanaya Lenskaya Cave 
(from A), mean annual surface temperature and surface air temperature 
changes (Lensk meteorological station 24923 data). C, Monitoring of air 
temperatures inside Botovskaya Cave (February 2010-February 2016). 
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Extended Data Fig. 3| Detailed permafrost maps. A, Permafrost map of the by black circle. Types and thickness of the permafrost are shown below. In 
Ledyanaya Lenskaya Cave area (reproduced with permission from ref. ®, places with continuous permafrost (types 18-20), it covers >95% of the area, 
Cambridge Univ. Press). The cave site is marked by a magenta circle. Lensk and but may contain small unfrozen units (taliks), mainly under permanent bodies 
Olekminsk are marked by black circles. B, Permafrost map of the Botovskaya of water. Taliks may go through the entire permafrost layer (through taliks), or 


Cave area®’. The cave location is marked by magenta circle. Zhigalovoismarked —_ through part of it (not through taliks). 


1) Stalagmite SLL10-6 4) Stalactite SLL9-2 


7) F 
---786 + 18 16 


lowstone SLL14-3 


8) Flowstone SLL10-4 


Ep 


782 + 10’ 


58047 


f 


ey 9) Flowstone 
2) Flowstone SLL10-5 5) Stalactite SLL9-1 SLL14-1 


860 + 10 


853 + 21 


3) Flowstone SLL14-5 6) Stalactite SLL10-9 
1439 + 11 1343 +14 


11) Stalactite SLL9-4 
1391 + 26 : 


1076414 


t 
958 +9 1374421 


i ayy 


Extended Data Fig. 4| Pictures of speleothems’ cross-sections with ages (in ka) from Ledyanaya Lenskaya Cave. Age uncertainties are 20. The U-Pb ages shown 
in black anda single U-Th age in stalactite SLL9-2" is shown in brown. Hiatuses are shown with red arrows. All speleothems are composed of calcite. 
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Extended Data Fig. 5| Pictures of speleothems’ cross-sections with ages of aragonite, except stalagmite SB-6919, and layers A-D and Ginstalagmite 
(in ka) from Botovskaya Cave. Age uncertainties are 20. The U-Pb ages shown SB-01112, which are calcitic. 
in black and U-Thages" are shown in brown. Most speleothems are composed 


Extended Data Fig. 6 | Speleothems’ petrography. A, Calcitic stalagmite stalagmite SB-7497(3) with fibrous crystals. D, Magnified area (3 to 1.5 cm) 
SLL10-6 with its layered structure. B, Magnified area (5 to3 mm-—rectangle on shown by black rectangle in C in plain polar light. 
the top of A) incrossed polar light, showing columnar crystals. C, Aragonitic 
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Extended Data Fig. 7 | Common Pb composition assessment for Ledyanaya 
Lenskaya (upper) and Botovskaya (lower) caves. 7*U-*’Pb and 7U-*°°Pb 
appear onthe left, and?*U—*’Pb on the right, showing consistent common Pb 
values for eachcave. The yintercept represents the common lead ratio; yellow 
bars show the assigned range for common Pb in age calculations. Groups refer 
to the regression groups in Supplementary Table 1 with solid black lines 
showing regression fits. Grey contours show the percentage changein 
calculated age resulting from changing the common Pb composition by its 
assigned uncertainty. Dashed black lines show reference isochrons and 
ungrouped data are shown by grey circles. Uncertainties (20) are sometimes 
smaller than the symbol size. Particular details for each plot areas follows. 

A, Estimate of the common”°*Pb/*°°Pb for Ledyanaya Lenskaya Cave in 

234) /2°6Ph-?8ph /2°Ph isotope space. **“U is used in the plot instead of **U to 
suppress scatter in the U/?°°Pb ratio due to variations in the 24U/7°U; ,i.q Fatio. 
All but two of the ungrouped data have age uncertainties due to common Pb of 
<1% and are thus insensitive to the common Pb correction. Most data that are 
more sensitive to the common Pb composition (that is, those between the 1% 
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and 3% contours) are included in the regressions to estimate the common Pb 
composition. B, Equivalent plot in?>U/?" Pb-*°8Pb/?”’ Pb isotope space for the 
estimation of the common?*Pb/?’Pb ratio for Ledyanaya Lenskaya cave. 
Groups 3 and 4 correspond to the clump of ungrouped data close tothe 
horizontal axis at about 0.225 onA. Note that ?*U/?"’Pb ages are substantially 
more sensitive to the common Pb correction; these ages are used as acheck on 
U-Pb concordance rather than to derive the dates used in the paper. C, Estimate 
of the common”°Pb/?°Pb for Botovskaya Cave in 7°U/?°°Pb-*°8Pb/?°Pb 
isotope space. Groups 1, 3, 4 and 6 (plotted as squares) are unpublished data 
from about 2 Masamples included here only to provide additional constraint 
onthecommon Pb composition (Supplementary Table 1). Ages used here are 
from the data plotted as circles. As for Ledyanaya Lenskaya Cave, the common 
Pb estimate includes the data for which the correction matters most as far as is 
possible. D, Equivalent plot in?*>U/?"’Pb-*°>Pb/?°’ Pb isotope space for the 
estimation of the common?*Pb/?’Pb ratio for Botovskaya Cave. ?°U/?’Pb data 
are more sensitive to the common Pbcorrection. 
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Extended Data Fig. 8 | Detailed speleothems’ chronologies. A, The 
chronology of Ledyanaya Lenskaya Cave speleothems with 95% confidence 
errors. Each columnin the plot represents one individual speleothem named in 
bottom horizontal axis. In each column, the U-Pb ages (purple circles) appear 
in stratigraphic order from left (young) to right (old). For each U-Pb age, the 
corresponding proportion of radiogenic?°*Pb (right vertical axis) is shown by 
red circles above. The U-Thage of the youngest layer A in the stalactite SLL9-2" 
is shown by blue circle (bottom left). The two isochron ages of layers F top andG 


in SLL10-6 stalagmite are shown by olive circles. Several replicate age 
determinations for similar layers were performed and appear in the plot inthe 
same order as shown in Supplementary Table1.B, Botovskaya Cave 
speleothems’ U-Th ages (dark blue circles, left y axis; Vaks et al.“ and current 
work) and U-Pb ages (light blue circles, lefty axis), the latter are given with 
percentages of radiogenic?Pb (red circles, right y axis). Allage uncertainties 
are shown by 95% confidence error bars. The stratigraphic age of the layers 
dated from stalagmite SB-01112 increases from left to right. 
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Extended Data Fig. 10 | HYSPLIT-model-based seven-day back-trajectories 169478; Job Start: Sun 8 Sep 13:58:40, UTC 2019; Meteorology: OOOOZ, 22 Oct 
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occurred in October and first half of November of 2013 (G-L). For each of 12 2013 - GDAS1 (G);Job ID: 172040; Job Start: Sun 8 Sep 16:46:37 UTC 2019; 
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sea level were calculated, with at least one of them originating inthe Arctic Sep 16:54:44 UTC 2019; Meteorology: OOOOZ, 15 Oct 2013 - GDAS1(I);Job ID: 
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Predator-prey cycles rank among the most fundamental concepts in ecology, are 
predicted by the simplest ecological models and enable, theoretically, the indefinite 


persistence of predator and prey’ *. However, it remains an open question for how 
long cyclic dynamics can be self-sustained in real communities. Field observations 
have been restricted to a few cycle periods’ * and experimental studies indicate that 
oscillations may be short-lived without external stabilizing factors” ”. Here we 
performed microcosm experiments with a planktonic predator-prey system and 
repeatedly observed oscillatory time series of unprecedented length that persisted 
for up to around 50 cycles or approximately 300 predator generations. The dominant 
type of dynamics was characterized by regular, coherent oscillations with a nearly 
constant predator-prey phase difference. Despite constant experimental conditions, 
we also observed shorter episodes of irregular, non-coherent oscillations without any 
significant phase relationship. However, the predator-prey system showed a strong 
tendency to return to the dominant dynamical regime with a defined phase 
relationship. A mathematical model suggests that stochasticity is probably 
responsible for the reversible shift from coherent to non-coherent oscillations, a 
notion that was supported by experiments with external forcing by pulsed nutrient 
supply. Our findings empirically demonstrate the potential for infinite persistence of 
predator and prey populations in acyclic dynamic regime that shows resilience in the 
presence of stochastic events. 


Cyclic dynamics are one of the most notable phenomenain population 
biology and are known to occur ina large range of communities bothin 
the wild> ®3" and the laboratory? ”>"®, A number of mechanisms that 
cause populations to oscillate have previously been identified?*”°; the 
most highly investigated, however, are the cyclic dynamics that arise from 
trophic interactions between populations of predator and prey organ- 
isms. The nearly century-old fascination with this type of dynamics is 
rooted inthe predictions of simple mathematical models, which suggest 
that the predator and prey may coexist on recurring cyclic trajectories 
over indefinitely long periods of time’. Empirical data that support 
the sustained nature of predator-prey cycles in the field are difficult to 
come by because tractable populations typically have cycle periods of 
three to ten years*>”, An alternative data source are laboratory studies 
with fast-reproducing organisms, with which long-running experiments 
canbe realized under strictly controlled conditions””°”"*”, Ina seminal 
experimental study, population numbers of weevils and their larval 
parasites were recorded for a time span of about 20 cycles but oscilla- 
tions could not be sustained for the whole duration of the experiment”. 
This observation is paradigmatic for many studies, which showed that 
cyclesin closed laboratory predator-prey systems tend to be short-lived, 
with populations either becoming extinct or reaching a stationary state. 

These findings suggest that sustained, intrinsically generated preda- 
tor-prey cycles might be a construct of ecological theory and that 


predator-prey systems require external drivers such as spatial struc- 
ture, immigration or environmental perturbations to exhibit persistent 
cycles*")2.1722.23 To explore the potential for long-term persistence of 
predator-prey cycles, we cultured freshwater organisms—planktonic 
rotifers together with their prey, unicellular green algae—in several 
independent chemostat trials under constant environmental condi- 
tions (Methods, experiments C1-C7). One of our experimental systems 
(Fig. la and Extended Data Fig. 1; Clin Extended Data Table 1) persisted 
without any external stimuliin a homogeneous environment for over 
ayear and displayed more than 50 cycles, which is equivalent to about 
300 predator generations and at least as many prey generations. This 
result of the long-term persistence of the predator and prey inacyclic 
dynamic regime was confirmed in four additional, replicated chemostat 
experiments with the same species (C2-CS5 in Extended Data Table 1, 
Extended Data Fig. 2), and in two additional experiments (C6 and C7) 
with a different algal prey species (Extended Data Fig. 3). We applied 
power analysis and phase analysis (in particular, bivariate wavelet 
analysis”"*°) (Methods) to quantify the cyclic succession and the 
distribution of phase differences in our measured time series. Wavelet 
analysis revealed statistically significant associations between the 
dynamics of predator and prey densities in our experiments and power 
analysis determined a mean period length of 6.7 days (C1-CS5) (Fig. 1d 
and Extended Data Fig. 1d, f). 
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Fig. 1| Phase analysis of a year-long, oscillatory predator-prey time series 
(experiment Cl). a, Time series of the predator, B. calyciflorus (red) and its 
prey, Monoraphidium minutum (green); abundance normalized toa range of O 
to1.b, Phase portrait of the bandpass-filtered predator-prey time series 
between days 90 and 135; the arrow indicates the direction of oscillation. 

c, Wavelet coherence (WCO) between predator and prey. Values range from O 
(blue) to1 (red). The cone of influence is indicated by black thin curved lines. 
The black contour lines enclose significant areas (95% significance level, 

WCO > 0.83). The thick black line segments show the instantaneous oscillation 


Transitions between different oscillation regimes 


Although the experiments were not perturbed by external influences, 
we identified sharp transitions between two different dynamic regimes. 
For most of the time, we observed oscillations with a significantly coher- 
ent, nearly constant phase relationship (coherent oscillation regimes) 
(Methods), in which the predator densities followed the prey densities 
with a phase lag of about 6 = 0.5m (equivalent to 90°), consistent with 
classical predator-prey theory. These periods with coherent oscilla- 
tions were intersected by non-coherent oscillation regimes in which the 
well-defined phase relationship between the measured signals was lost, 
but re-initiated shortly after (Fig. 1c, e). For example, the time interval 
between days 100 and 131 in Cl (Fig. 2a, b) and between days 37 and 66 
in C7 (Fig. 2c, d) showed sustained cycles with regular period lengths 
and aconstant phase lag between prey and predator densities, whichis 
also reflected in the anticlockwise motion in the prey-predator phase 
plane (Fig. 1b). This regime of coherent oscillations gave way toa non- 
coherent, more irregular time series (days 132-161 in Cland days 67-97 
in C7), in which the phase relationship between prey and predator was 
lost, although both populations continued to oscillate (Fig. 2). Finally, 
and without external intervention, the system displayed resilience and 
the characteristic predator-prey phase difference was re-established. 

Combining the analyses of all five experiments with the same preda- 
tor and prey species (C1-C5), we found that coherent oscillations were 
the dominant type of dynamics and comprised 66% of the total experi- 
mental time with a typical regime duration of 58 days (Extended Data 
Fig. 4and Methods). Inthe remaining 34% of the experiments, predator 
and prey continued to oscillate, but without a well-defined phase rela- 
tionship (typical duration of non-coherent regimes, 23 days) (Methods). 
Phase analysis of surrogate data revealed that spurious coherent oscil- 
lation regimes can arise also by chance, but with a significantly smaller 
duration (typical duration, 15 days) (Extended Data Fig. 4). 


period S(t) of highest WCO within these areas (coherent oscillation regimes). 
d, Global wavelet power spectrum P(s) for the predator (red) and the prey 
(green). e, Phase difference 0(t) between predator and prey; well-defined 
relations between both signals are marked in red. f, Circular distribution of the 
mutual phase difference to prey (indicated in green) for the predator (red), the 
predator egg ratio (blue), the number of eggs (black) and the dead animals 
(yellow). c, d, The two horizontal lines indicate the prefixed period band of 

[s,, S.].a,c, e, The solid vertical lines indicate selected time spans of regular and 
irregular oscillations, which are enlarged in Fig. 2. 


We performed additional analyses and experiments that were 
designed to provide mechanistic insights into the origin of the observed 
coherent oscillations and the intermittent breaks in the phase signature. 
First, we extended the specificity of the phase analysis by including 
predator life-history stages. Second, we formulated and analysed a 
mathematical model that accounts for the predator’s stage structure. 
And third, we performed three additional chemostat experiments: two 
experiments with periodic changes in the nutrient concentration inthe 
supply medium, to explore how the phase relations were influenced 
by external forcing (C8 and C9); and one experiment at dilution rates 
that, according to bifurcation theory””’ and our mathematical model, 
should cause the predator-prey system to cross over from oscillatory 
to equilibrium dynamics (C10). 


Phase analysis of life-history stages 

The predator in our system, the rotifer Brachionus calyciflorus”’, is a 
small freshwater metazoan that reproduces asexually in the chemo- 
stats (only females occur) and undergoes a stage-structured life cycle. 
Reproducing females carry 1-5 eggs, from which juveniles hatch and 
grow to adulthood without any larval stages; adults die shortly after 
the last egg has hatched. In addition to analysing algal and rotifer densi- 
ties, we applied phase analysis to all discernible predator life-history 
stages: the number of eggs, the egg ratio (that is, the average number 
of eggs per female rotifer) and the number of dead animals. We found 
that these life-history parameters also exhibited persistent cycles that 
were coherently locked in phase with prey abundance (Fig. 3a). These 
phase differences were notably constant over all different experimental 
replicates (Extended Data Figs. 1-3). Averaging across all experiments 
(C1-C5), we obtained the following phase signature: the predator den- 
sity followed the prey density with a phase difference of 93° + 21°, the 
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Fig. 2| Resilience of phase relationship in experiments with two different 
prey species. a, Enlarged view of the densities of the predator (animals per 
millilitre, red) and prey (M. minutum, 10° cells per millilitre, green) in 
experiment Cl for atime window of 90 days. b, Enlarged view of their WCO 
during the same time interval (colours and lines as in Fig. 1c). a, b, Solid vertical 
lines border atime interval without significant wavelet coherence between 
predator and prey, separating three dynamic regimes: days 100-131, distinct 


abundance of eggs was delayed to the abundance of prey witha phase 
difference of 27° +19°, whereas the egg ratio preceded the prey signal 
(-27° + 16°); dead animals had a phase difference of -160° + 68°. These 
mutual phase differences can be compactly visualized in a single cir- 
cular histogram”! (Figs. 1f and 3b). This ‘phase signature’ enables the 
straightforward interpretation of the temporal succession of the com- 
munity during the cycles and thus provides a fingerprint for the cyclic 
community structure. 


Mathematical model 


We compared our experimental results with numerical simulations of 
a stage-structured predator-prey model that included the operative 
parameters of the chemostat (inflow nutrient concentration and dilu- 
tion rate) and essential organismal variables (algae as well as rotifer 
eggs, juveniles, adults and dead animals) (Extended Data Fig. 5 and 
Methods). Numerical simulations revealed sustained oscillations with 
a period length of 6.8 days (experimental mean, 6.7 days). When we 
introduced stochasticity into the model, we observed similar breaks 
in the oscillation regimes as in experiments C1-C7, with random tran- 
sitions between coherent and non-coherent oscillating regimes that 
were triggered by stochasticity alone. We were still able to recover a 
phase signature that aligned well with the experimental observations 
(predator, 90° + 11°; eggs, 11° + 13°; egg ratio, —-21° + 2°; dead rotifers, 
-150° + 60°) (Fig. 3c, d and Extended Data Fig. 5). 

Notably, both inthe data and the model, the predator egg ratio pre- 
ceded the phase of prey availability, which is not immediately obvious as 
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predator-prey cycles witha constant phase shift of 6 = 1/2; days 132-161, 
regime of non-coherent oscillations (grey background); days 162-190, 
spontaneous re-initiation of the regular phase-locked oscillation.c, d, Asin 

a, b, but showing a time window of 90 days for experiment C7 witha different 
algal prey, Chlorella vulgaris. Non-coherent oscillations arise between days 67 
and 97. 


egg productionis directly connected to food availability. This phase lag 
between prey density and egg ratio disappeared in model simulations 
when the juvenile maturation rate was set to zero (Extended Data Fig. 6), 
revealing the effect of juveniles on the phase difference signature. 
Juvenile predators (which, as immature individuals, carry no eggs by 
definition) have a crucial influence as they lead to an early breakdown 
of the egg-ratio maximum, caused by the multitude of non-egg-bearing 
juveniles during the prey maximum, finally giving rise to a negative 
phase difference between egg ratio and prey (Fig. 3a, c and Extended 
Data Fig. 6a). Thus, as physiological information may be encoded inthe 
phase difference distributions, extending phase analysis to life-history 
stages enabled deeper insights into the functioning of our system. 


Additional experiments 


In experiments C8 and C9, we externally forced the dynamics 
through pulsed supply of nutrients (Methods). We chose a forcing 
period (8.0 days) that was clearly distinguishable from the period 
of the unforced experiments (6.7 days) but sufficiently close to 
enable, in theory, predator and prey (thatis, living populations with 
demographic constraints) to lock to the external signal. Indeed, we 
observed predator-prey oscillations that were locked to the external 
signal, both in chemostat experiments and model results (Fig. 3e, g 
and Extended Data Fig. 7). As expected, oscillations were persistently 
coherent, without the non-coherent breaks in the time series from 
the unforced experiments (C1-C7), although intermittent period 
doubling occurred in experiment C8 (Fig. 3e and Extended Data 


b 0.51 


a 
® 
rs) 
= 
o 

5S 
= 
=) 
Te) 
<x 
10 15 20 25 30 35 

c 
® 
cs) 
Cc 
© 

ne) 
= 
5 
a 
~¢ 

e 
® 
cs) 
Cc 
@ 

ne) 
= 
SD 
Te) 
<x 
45 50 55 60 65 70 75 80 
9g 
® 
s) 
= 
io) 
ne) 
Cc 
5 
o 
< 


Time (days) 


Fig. 3 | Dynamics and phase relationships in unforced versus externally 
forced systems. a, Time series of algae (green), rotifers (red), eggs (black), egg 
ratio (blue) and dead animals (yellow) inan unforced (that is, ‘free-running’ 
under constant conditions) experiment covering five oscillations (days 10-35 
in C2). Data were filtered with a second order Butterworth bandpass filter and 
are shown in relative units. b, Phase signature aggregated over all experiments 
with the algal prey M. minutum under unforced conditions (C1-CS, including a 
total duration of 965 days), showing the circular distribution of the mutual 
phase differences to prey (colour code as ina).c, Simulated time series from 


Fig. 7c, d). Notably, externally forced and unforced experiments 
produced essentially the same phase signature (Fig. 3b, f), which 
was repeated also in model simulations (Fig. 3d, h). Thus, evenina 
dynamic regime that was strongly influenced by and locked to an 
external factor, phase analysis of different life-history stages was 
able to reveal the underlying predator-prey structure and to disen- 
tangle between intrinsic and external origins of cycles. This demon- 
strates that phase signature analysis can be a powerful diagnostic 
tool for the analysis of field data, for which it is often not known 
whether oscillations are the result of external factors or originate 
from species interactions. 

Finally, we confirmed that the coherent oscillations observed in 
our experimental system indeed represent stable limit cycles, which 
arise from the destabilization of a stable equilibrium point. Oscilla- 
tions in experiment C10 persisted only in a certain parameter range; 
if the dilution rate was increased, the oscillations died out (Extended 
Data Fig. 8). This result agrees with model predictions and previous 
experiments showing that increased dilution rates drive the system 
over a Hopf bifurcation’® (Extended Data Fig. 9). 


Conclusions 

Persistent and coherent predator-prey oscillations are a potential 
dynamic regime enabling the prolonged coexistence of predator and 
prey populations, as predicted by fundamental theory. In live systems, 
stochasticity and other external perturbations can obscure or prevent 
predator-prey oscillations, and our simulations suggest that stochas- 
ticity is probably responsible for the reversible shifts from coherent 


the predator-prey model under constant conditions with all stochasticity 
removed (mean centred, relative units). d, Phase signature obtained from 
stochastic simulations of the predator-prey model reproduces the 
experimentally observed phase relationships and variances inb.e, Asina, but 
for an experiment with external forcing (days 45-88 of C8). The concentration 
of the external medium is shown in magenta. f, Phase signature aggregated 
over all experiments with external forcing (C8-C9, including a total duration of 
323 days).g,h, Asinc, d, but for simulations in an externally forced system. 


to more-erratic oscillations that we observed in our highly controlled 
experiments. Four separate lines of evidence support a causal rela- 
tionship between sustained cycles and predator-prey interactions. 
(1) Cyclic patterns dominated throughout the entire experiment and 
the system displayed resilience—that is, it returned to a well-defined 
phase relationship after intermittent non-coherent periods. (2) Planned 
experimental interventions enabled us to suppress cycles (C10) and to 
quench intermittent non-coherent oscillations (C8 and C9). (3) Dataon 
the stage structure of the predator population enabled us to explicitly 
link characteristics of the predator-prey cycles to recurring changes in 
the demographic structure of the predator population. Therefore, our 
case for sustained predator-prey cycles is much stronger than itis pos- 
sible for unstructured microbial populations, in which population den- 
sities are the only variables. (4) Finally, the experimental predator-prey 
cycles manifested as aregular phase-locked succession, a pattern that 
we were able to capture as acommunity phase signature. Our approach 
firmly establishes this phase signature—which, in a similar way, can 
be constructed for any oscillating system—as a robust measure for 
identifying species interactions, cyclic or seasonal species succession, 
and the regulatory dependencies that underlie community dynamics. 


Online content 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Experiments 

We ran chemostat experiments with parthenogenetic rotifers (B. 
calyciflorus sensu strictu”’, asmall metazoan freshwater zooplankton 
species) as predators and unicellular algae (M. minutum or C. vulgaris) 
as prey under constant temperature (23 °C) and permanent illumina- 
tion. Experiments were conducted (mostly consecutively) in the same 
well-maintained climate chamber. We carefully searched for possible 
sources of external perturbations, but all measured parameters (for 
example, temperature and irradiance) remained constant throughout 
the experimental runs. For inoculation, we used stock cultures origi- 
nally raised from a single individual and added B. calyciflorus 10 days 
after the algae had reached a biomass that enabled rotifer growth. The 
chemostats (experimental volume, 0.81) hada constant inflow of sterile 
medium witha rate of 0.55 per day. We used a modified WC medium”® 
with a reduced nitrogen concentration of 80 pmol I nitrate as the 
limiting resource. In the experiments with external forcing, we periodi- 
cally changed the concentration in the external medium to between 
0and 160 pmol!‘ nitrate with a period length of 8 days. We took daily 
subsamples of 8 ml to determine the abundance of both predator and 
prey. The algal abundance was analysed with an electronic particle 
counter (CASY, Scharfe). The rotifers were counted using an inverted 
microscope at 100-fold magnification. For the rotifers, we recorded 
the total number of rotifers and of asexually produced subitaneous 
eggs and the dead animals. No males or sexually reproducing females 
were observed throughout the experiment. The egg ratio (that is, the 
total number of eggs divided by the total number of animals in the 
population) is a measure of the reproductive status of the population. 


Bivariate wavelet analysis 

Because the measured signals showed large variability, we applied 
phase analysis®, which relies on the fact that the regulatory depend- 
ence between state variables is often encoded in their phase relation- 
ship, whereas the amplitudes may be highly erratic and uncorrelated®. 
The wavelet method” is a particularly powerful tool for extracting 
optimally resolved phase information from ecological time series>”° 
and enables us to quantify transient associations between two non- 
stationary signals*”? ?, 


Continuous wavelet transformation 

The continuous wavelet transformation™ (CWT) of a signal x(¢) is 
calculated as the convolution of the signal with a localized complex- 
valued wavelet function w(t) centred at time t and dilated by the scale 
parameters 


1. (t-t 
Wis,t)=—= fdexerw| 


Ss 


Here, the asterisk denotes the complex conjugate. After decompos- 
ingthecomplex CWT into amplitude and phaseW,(s, t) =|W,(s, tleiex60 
intuitively it can be understood as the extent |W,(s, 0)| to which the 
signal x(0) at local time t resembles an oscillation with period lengths 
and phase @,(s, 0). 

Throughout our analysis, we used the Morlet wavelet w(t)= 


10 '/4gi#ote-t’/2 with w, = 6 so that the scale parameter s can simply be 
interpreted as the period length of the oscillation™*. We always took s 
from 100 uniform steps per octave ona logarithmic scale. Edge effects 
were minimized by applying zero padding to the time series to the next 


highest integer power of two. We always indicate the cone of influence, 
which marks the zone in the CWT that is affected by edge effects”. 


Wavelet power spectrum 
Although the wavelet transform is complex, the real-valued local wavelet 
power spectrum (WPS) ofa signal x(t) at time tand scales canbe defined as 


W,..(S, t) = (W,(s,0) WA, t)) 


in which “...)’ denotes a smoothing operator in both scale and time (see 
‘Parametrization and smoothing’). The global WPS P,(s) is defined as 
the time average of the local WPS and measures the averaged variance 
of the signal x(t) at scale s. We used the maximum of P,(s) to estimate 
the mean oscillation period of the signal. 


Wavelet cross-spectrum 

To quantify the statistical relationship between two non-stationary 
signals x(t) and y(¢), the wavelet cross-spectrum (WCS) is defined as 
the smoothed product of the corresponding wavelet transforms”? 


W, (5,0) = (Wes, t) W5(s, t)) 


The WCS is complex-valued. After decomposition into amplitude 
and phaseW, ,(s, t) = A,., (s, t) e'“?» © the phase describes the de-lay 
A®@,, . (s,t)=@,(s, 0) - @,(s, Qbetween the two signals at time tand scale 
s, whereas the amplitude A,,,(s, t) expresses the covarying power of the 
two processes. 


Wavelet coherence 

The WCS is not specific, because it exhibits high values when either a 
true covariance between the two signals exists or when one of the spec- 
tra exhibits a high value. To solve this problem, anormalized measure 
is given by the WCO”? *!, which is defined as the amplitude of the WCS 
divided by the square roots of the two local WPS 


W,.,(s,0)| 


WCO,, ,(s, t) = 
xy W,..AS, t)¥? Wy y(S, t)¥? 


The WCO estimates the relationship between the two signals at time 
tand scales, normalized into the range 0 < WCO <1. As the WCOis real- 
valued, it cannot be used to extract the delay between two signals, in 
contrast to the WCS. 


Dominant phase difference 

The phase difference A@, ,(s, t) between two signals x(t) and y(¢), as 
obtained from the WCS, depends on the scale parameter s. Assume 
now that the two signals oscillate with acommon, but possibly time- 
varying, period length. To obtain their dominant phase difference, we 
define for every time instance ¢t the dominant scale S(t) as the scale 
parameter that has the maximal WCO at time ¢ over all scales in a pre- 
fixed band [s,, s,]: 


$(t) = argmax WCO(s, f) 


S1<S<Sy 

These values S(t) constitute a line of the strongest co-oscillating 
component of both signals in the timescale plane”. Inserting this line 
in the WCS yields the dominant phase difference 6,.,(t) between the 
two signals at every time instance 


9, y(t) = G, (5(0),0 - SO, 0) 


Significance testing 
Toassess the statistical significance of the WCO, we generated n=1,000 
independent realizations of surrogate data by time-scrambling the 
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empirical signals (that is, sampling 1,000 random values with replace- 
ment from the measured predator and prey time series). The resulting 
surrogate data retain the frequency histogram of the original data 
but lose all temporal correlations. Using Monte Carlo simulations, we 
found that for our chosen parameterization (see ‘Parametrization and 
smoothing’) two signals are significantly correlated at time t and scale 
sona 95% critical level if WCO(s, f) =0.83. 


Coherent oscillation regime 
We defined a maximal contiguous time interval for which the dominant 
scale $(t)is within a statistically significant area in the timescale plane— 
that is, being inside the cone of influence and having a significant 
WCO(S(0), t) > 0.83—as a coherent oscillation regime. The remaining 
time intervals constitute the non-coherent oscillation regimes. Each 
signal on its own may well oscillate in a regular fashion even within a 
non-coherent oscillation regime, but for a pair of signals the wavelet 
analysis guarantees a significantly correlated oscillation—and therefore 
awell-defined phase relationship—only ina coherent oscillation regime. 
To characterize the typical duration? of coherent oscillations inan 
ensemble of signals, we first measure the length 7, of each coherent 
oscillation regime i=1... Nthat occurs in the ensemble and then cal- 
culate the expected oscillation regime duration, 7, ofarandomly cho- 
sen time instance taken from a coherent oscillation regime 
7 =>",7?/d",7;. We found that this measure gives a better 
description of the typical time to observe a coherent oscillation inter- 
val than the median or the meanT = ¥™, 7;/Nof the length distribution. 


Circular phase distribution 

To characterize the phase relationship between a pair of covarying 
signals, we calculated the dominant phase differences 0, for all time 
instances within a coherent oscillation regime. We used circular sta- 
tistics? to characterize the resulting distribution of phase differences 
6,. For this we calculated the complex order parameter z, which is 


defined as the average of the complex numbers elk 
wenn tq Me & 
z= Rel? =— > el 
wp, 


Here the phase angle 6 of zindicates the circular mean of the phase 
differences, and the amplitude R of z gives the circular standard devi- 
ation of the distribution as /-2 InR. 

Given the sampling frequency of 1 day, the Nyquist frequency is 
0.5 day“, which for an average period length of 6.7 days roughly cor- 
responds toa phase resolution of 27°. Averaging phase differences 
over many cycles enabled us to achieve a much better phase resolution, 
yielding typically rather smooth distributions of phase differences. 
We verified that these smooth phase distributions are no artefacts, 
by comparing phase distributions obtained from our highly resolved 
numerical simulations (see ‘Numerical simulation’) with those obtained 
from time series sampled from these simulations once per day, which 
yielded very similar results. 


Phase signature 

Tocharacterize the mutual phase relationship of not onlytwo but also an 
ensemble of time series (for example, total densities of predator and prey 
and of measured life history stages), we calculated the distribution of 
phase differences of all time series with respect to one selected reference 
time series. The combined plot of all resulting circular distributions in 
onecommon polar histogram defines our ‘phase signature’, providinga 
fingerprint for the cyclicsuccession of the corresponding state variables. 


Parametrization and smoothing 
Following a previously published study”, we performed smoothing in 
time by convolution with a normalized real Morlet wavelet 


exp[-17/(2s”)]/(s/211). Smoothing in scale direction was obtained by 
averaging over aconstant-length (boxcar) window over 0.6 octaves—the 
scale decorrelation length for the Morlet wavelet—on a logarithmic 
scale™. Similarly, we took the upper and lower scale band borders s, 
and s, as + 0.6 octaves from the peak in the global (that is, time-aver- 
aged) WCS. For a better visualization, circular histograms (bin width of 
1/100) were smoothed by a moving average (window width 0.1511, the 
above calculated phase resolution). 

We verified that the resulting phase distributions remain largely 
unchanged for different parameterizations of the wavelet analysis, 
in particular for different wavelet functions, different values of the 
wavelet parameter wp, different smoothing parameters and scale band 
borders s, ands,. 


Numerical simulation 
We developed a mathematical model to describe a stage-structured 
predator-prey community in a chemostat**”®, which closely follows 
our experimental setup. The model includes as state variables the 
concentration of nitrogen N, phytoplankton P, and three life stages 
of the predator, namely eggs F, juveniles/ and adults A. Additionally 
the model keeps track of the dead animals D in the chemostat. We 
assume a constant time delay 6 for egg development, a constant 
time delay t for juvenile maturation, and the same constant mor- 
tality rate for juveniles and adults (deliberately keeping the model 
simple, as intake-related regulations***’ did not improve the agree- 
ment between simulation results and data). We assume that the phy- 
toplankton population is unstructured. All variables are measured 
inumol NI". 

The model is written as the following system of time-delayed ordinary 
differential equations: 


N= 6N,,- Fp(N)P- ON 
P=F,(N)P- Fy(P)B/e - 6P 
E=R,-R,- 6E 

J =R)-Ry- (m+ 6 
A=BR,-(m+6)A 
D=m(J+A)-6D 


Here, £ is the adult/juvenile mass ratio (the juvenile/egg mass ratio 
is assumed to be 1) and the recruitment rates are 


R,(t) = F(P(0) A(t) 
R(t) =R,(t- 8) exp(-66) 
R,(t) = R(t- 7) exp(-6r) 


Algal nutrient uptake F,(N) is modelled as a Monod function and 
predator recruitment as a type-3 functional response with Hill coef- 
ficient x 


N 
ag are 


pk 
FP) = BKK pk 


The total predator density is given by B(d) = B/(t) + A(o) and the egg 
ratio is BE(0)/B(O. As state variables are given in units of nitrogen, juve- 
niles and eggs here need to be scaled by the adult/juvenile mass ratio B. 

The model is solved numerically by an Euler scheme with a time 
step of h= 0.001 days. We included environmental stochasticity by 
taking the growth rates r, and r, as autocorrelated random num- 
bers by multiplying them in every time step i with a factor of 1+ s,. 


Autocorrelation (that is, red noise) is implemented as s;,, = as; + §; 
(first-order autoregressive AR(1) process*’) with autocorrelation 
parameter a and independent and identically distributed Gaussian 
numbers € with mean 0 and variance o*(1 —a’). For wavelet analysis 
and for representation in figures, the simulated states are sampled 
witha rate of 1 per day, in accordance with the experiments. To cap- 
ture measurement error and additional stochastic influences, we 
replaced the sampled rotifer state variables, F(t),J(0), A(t) and D(0), 
by random values drawn from a Poisson distribution with an expecta- 
tion value equal to that of the state variable. Finally, algal and rotifer 
state variables were rescaled to individuals per volume using the 
constants Vp, and vz, respectively. 

Parameter values were taken as follows. Mass ratio adult/juve- 
niles, 8 = 5; dilution rate, 6 = 0.55 per day; nitrogen concentration 
in the external medium, N,, = 80 pmol I; phytoplankton maximal 
growth rate, rp =3.3 per day; phytoplankton half-saturation constant: 
kp=4.3 pmol I; rotifer maximal egg-recruitment rate, r, = 2.25 per 
day; rotifer half-saturation constant, k, = 15 pmol I"; Hill coefficient 
of the functional response, x = 1.25; predator assimilation efficiency, 
€=0.25; rotifer mortality rate, m=0.15 per day; egg development time, 
0= 0.6 days; juvenile maturation time, T= 1.8 days; autocorrelation 
parameter, a=0.9; noise strength, o= 0.5; nitrogen content per adult 
female Brachionus, v,= 0.57 x 10 mol N per female; nitrogen content 
per algal cell, v»=28 x 10° pmol N per cell. 

Environmental parameters (6, N,,,) were taken according to the 
experimental set-up. Parameters for uptake and growth (rp, Kp, rg, Kg 
and €) were taken according toa previous study”. The rotifer mortality 
rate (m) was increased compared with this previous study”, as we did 
not include the loss of the reproducing rotifer cohort into senescent 
animals. The mass ratio of adults to juveniles (8) was taken from the 
literature®’. Duration of development stages (0, T) was adjusted to be 
within the previously reported range*®. We used a type-3 functional 
response*”*°. Only the parameter values of population dynamics 
(9, T, K and m) and stochasticity (noise levels and noise colour) were 
adjusted by hand within ecologically reasonable ranges to simultane- 
ously optimally fit the comprehensive experimental data. As criteria 
we used the observed range and variability of the state variables, 
existence of oscillatory dynamics with correct period length, phase 
signatures (that is, phase relationships between all state variables, 
including the egg ratio), power and wavelet spectra, and the typical 
duration of coherent oscillatory regimes—both in the unforced and 
forced scenarios. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this article. 


Data availability 


Experimental data are publicly available on Figshare (https://doi. 
org/10.6084/m9.figshare.10045976.v1). 


Code availability 


Data analysis and the model were implemented in the language Julia 
v.1.1". The source code for the wavelet analysis is publicly available on 
GitHub (https://github.com/berndblasius/WaveletAnalysis). 
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Extended Data Fig. 1| Detailed dynamics and phase relationship in 
experiment C1. a, Time series of the predator, B. calyciflorus (red), andits prey, 
M. minutum (green), (normalized to the range of 0 and 1). b, Phase portrait of 
the bandpass-filtered predator-prey time series between days 90 and 130. The 
arrow indicates the direction of oscillation. c, Local WPS for the prey, ranging 
from 0 (blue) to maximum (red), and the instantaneous oscillation period S(t) 
(black line) of the highest WCO within the prefixed period length band 
(horizontal black lines). The cone of influence is also indicated (black). 
d, Global WPS P(s) for the prey (green). Horizontal lines asinc.e,f, Asinc,d, but 
for the predator. g, WCS between predator and prey, colour-coding and other 
lines asinc. h, Amplitude of the global WCS for the predator and the prey. 


1.57 


i, WCO between predator and prey, ranging from O (blue) to1 (red). Significant 
areas (WCO > 0.83) are enclosed by thin solid lines; the thick black line 
segments show the instantaneous oscillation period $(t)in these segments. 

j, Histogram of period lengths within coherent oscillation regions as detected 
ini. k, Phase difference 6(¢) to prey for the predator (red), the predator egg ratio 
(blue), the number of eggs (black) and dead animals (yellow). Significant 
relations between respective signals are marked as a solid line. I, Circular 
distribution of the mutual phase difference to prey (indicated in green) for the 
predator (red), the predator egg ratio (blue), the number of eggs (black) and 
dead animals (yellow). 
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Extended Data Fig. 2| Dynamics and phase relationships in three further Global WPS. e-h, As ina-—d, for experiment C3. i-I, Asina-d, for experiment C4. 
experimental time series in a constant environment. a-d, Analysis of Details and coloursas in Fig. 1. 


experiment C2. a, Predator-prey time series. b, Phase signature.c, WCO.d, 
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Extended Data Fig. 3 | Dynamics and phase relationships in measured time series with a different algal species (C. vulgaris). a—d, Analysis of experiment Cé6.a, 
Predator-prey time series. b, Phase signature.c, WCO.d, Global WPS. e-h, As ina-d, for experiment C7. Details and colours as in Fig. 1. 
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Extended Data Fig. 4 | Dynamics and phase relationship in surrogate data 
and length distribution of coherent oscillation regimes. a-f, As in Fig. 1, but 
for surrogate data, obtained by time scrambling—that is, randomly drawing 
360 values with replacement from the predator and prey time series in 
experiment Cl—yielding atime series without temporal correlation (white 
noise) but with the same abundance distribution as the experimental data. The 
figure shows atypically outcome. Occasionally, for short bursts the 
randomized predator and prey time series oscillate with acommon frequency 
(a), yielding areas of high coherence (c; WCO > 0.83; spurious coherent 
oscillation regimes) with random phase differences (e). Whereas most time 
intervals with coherent predator-prey cycles inthe surrogate data havea short 
duration, inthis example (by chance) there is one regime (days 187-209) witha 
duration of 23 days. g, h, Box plots of oscillation regime length. Boxes show the 
interquartile range, orange lines indicate the median, whiskers range from the 
lowest to the highest data point within 1.5 the interquartile range and markers 
indicate outliers. g, Length distribution of coherent oscillation regimes for the 
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experimental data under free-running conditions (C1-CS5) and for alarge 
ensemble of surrogate data. This shows that time intervals with coherent 
oscillations are significantly longer in the experiments (n =20; median, 16 days; 
mean +s.d.,29 +28 days) than in the surrogate data (n = 13,719; median, 8 days; 
mean ts.d.,9+7 days). The two outliers in the left box plot correspond tothe 
two long coherent oscillation regimes of 111 days and 85 days in C1, which (even 
though they are outliers in the length distribution) effectively dominate the 
dynamic behaviour in Cl. Consequently, we calculate the typical duration T of 
coherent oscillations as the expected regime duration for arandomly chosen 
time instance taken froma coherent oscillation regime (Methods) (T= 58 days 
in C1-C5;7 =15 days in the surrogate data). h, Asing, but for durations of non- 
coherent oscillation regimes, showing that time intervals of non-coherent 
oscillations are significantly shorter in the experiments (n =15; median, 17 days; 
mean +s.d.,16 +11 days; typical duration 7 =23 days) than in the surrogate data 
(n=12,719; median, 43 days, mean+s.d.,60+11 days; typical duration 

T=113 days). 
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Extended Data Fig. 5 | Detailed dynamics and phase relationshipsina 
modelled time series. a—l, As in Extended Data Fig. 1, but for asimulated time 
series of the stochastic time-delayed model ina constant environment. The 
simulation time was 300 days. A typical model outcome is shown. In this 
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example, we obtained five coherent oscillation regimes (indicated as thick 
lines ini, k). Similar to the experimental systems (C1-C7), these regimes are 
interspersed by short non-coherent oscillation regimes without a significant 
phase relationship. 
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Extended Data Fig. 6 | Dynamics and phase relationships in a model without 
juvenile maturation delay. a, Simulated time series of algae (green), rotifers 
(red), eggs (black), egg ratio (blue) and dead animals (yellow) ina model 
without juvenile maturation delay, t= 0, under constant conditions with all 
stochasticity removed (mean centred, relative units). b, Phase signature 
obtained from stochastic simulations (free-running conditions, r=0).c,d, Asin 
a,b, but for an externally driven system (external nutrient concentration 
shown in magenta). The figure shows that ina scenario without juvenile 
maturation delay (r= 0), the phase of the egg ratio coincides with that of the 


algae, bothina free-running and in an externally driven model. This isin 
contrast to the experimental observations (Fig. 3, rows land 3) and simulations 
with non-vanishing juvenile maturation delay (t=1.8 days) (Fig. 3 rows 2and 4, 
Extended Data Fig. 9d, f), in which the egg ratio is significantly preceding the 
algae signal. Thus, the phase signature helps to disentangle life-history 
mechanisms, as—by comparison of observed and simulated phase 
signatures—a delay of juvenile maturation is essential to attain the 
experimentally observed phase relations. 
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Extended Data Fig. 7| Dynamics and phase relationshipsinexternallydriven as inFig.1.Ina,e,i, the blue dotted line shows the input concentration inthe 
systems. a-—d, Analysis of experiment C8.a, Predator-preytimeseries.b,Phase external mediuminnormalized units. Inb, f,j, the phase difference distribution 
signature.c, WCO.d, Global WPS. e-h, As ina-—d, for experiment C9. i-l, Asin between the external medium and the prey is shown in magenta. 

a-d, for simulation results of an externally driven system. Details and colours 


abundance 


s 
£ 
ie) 
c 
G 
no} 
S 
© 
eo 10 20 30 40 50 
time ind 


abundance 


60 70 80 0 


i 
p= 
Da 
[= 
a) 
SS 
fs) 
o 
Gg 25 50 75 100 
time ind 


abundance 


125 


= 

= 

te 

io) 

Cc 

a) 

ne} 

S 

© = ‘ 

=O 25 50 75 100 

time ind 


Extended Data Fig. 8 | Dynamics and phase relationships in experiments and 
simulations with changed dilution rate and press perturbations. a—d, 
Analysis of experiment CS, which has an increased dilution rate (6= 0.66 per 
day). a, Predator-prey time series. b, Phase signature. c, WCO.d, Global WPS. 
e-h, Asina-d, for experiment C10, which undergoes press perturbations 
(vertical black lines) at day 84 (dilution rate was increased from 5 = 0.55 per day 
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to 6=1.2 per day) and day 123 (dilution rate increased further to 6=1.35 per 
day). i-I, As ina-d, with simulation results of asystem undergoing a press 
perturbation in which the dilution rate was increased to 6=0.75 per day at day 
100 (vertical black line ini, k). Under conditions of the increased dilution rate, 
the oscillations are suppressed (Extended Data Fig. 9a). Details and colours as 
in Fig. 1. 
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Extended Data Fig. 9 | Bifurcation diagrams. a, Simulated values of predator 
(red) and prey (green) for different values of the dilution rate. Thick solid lines 
show maximal and minimal values obtained in the unforced model without 
stochasticity. Shaded areas show range of values in the stochastic model. 
Sustained predator-prey oscillations appear for 0.47 <6<0.71per day. For 
large values of the dilution rate, 6 > 0.87 per day, the predator goes extinct. b, 
As ina, but for the externally forced system. The model shows oscillations for 
all parameters without predator extinction 5 < 0.8 per day, it exhibits a 
bistability regime (small- versus large-amplitude oscillations) for 

0.23 <6&< 0.31 per day anda period-2 oscillation regime for 0.42 <6 < 0.57 per 
day.c, d, Bifurcation diagrams of the unforced model asa function of the 
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juvenile maturation delay t.c, Asina, but for the different bifurcation 
parameter. d, Phase difference to the algae for the predator (red), eggs (black) 
and the egg ratio (blue). The shaded areas indicate the range of phase 
differences (+1s.d.) from the experiments in a constant environment (C1-C5). 
For large values of r, the simulated phase signature aligns with the 
experimental data, whereas for small values of t, the simulated phase 
difference of the egg ratio approaches zero. e, f, Asinc, d, but for the externally 
forced model. e, For small delay times, 0.4 <1t<1day, the model exhibits a 
chaotic regime. f, Additionally the phase difference between algae and external 
nutrients is shown (magenta). Vertical solid lines show the actually used 
parameter values 6=0.55 per day and r=1.8 days. 


Extended Data Table 1| Summary of all experiments 


Code Experimental conditions | Algal species Duration (days) | N; (mol I’) 6 (day”) 

C1 constant environment M. minutum 373 80 0.55 

C2 constant environment M. minutum 189 80 0.55 

C3 constant environment M. minutum 181 80 0.55 

c4 constant environment M. minutum 137 80 0.55 

C5 constant environment M. minutum 85 80 0.66 

C6 constant environment C. vulgaris 230 160 0.55 

C7 constant environment C. vulgaris 233 160 0.55 

C8 externally forced M. minutum 183 0/160 0.55 

c9 externally forced M. minutum 140 0/160 0.55 

C10 press perturbation M. minutum 197 80 0.55/1.2/1.33 
A series of ten chemostat experiments was performed, constituting a total of 1,948 measurement days (corresponding to 5.3 years of measurement) and covering 3 scenarios. (1) Constant 
environmental conditions (C1-C7, 1,428 measurement days). This scenario consisted of 4 trials with the alga M. minutum (C1-C4), 1 trial with M. minutum and an increased dilution rate of 6 = 0.66 
per day (C5), and 2 trials with the alga C. vulgaris and a doubled concentration of the external medium N,, = 160 umol t (C6 and C7). (2) Externally forced conditions (C8 and C9, 323 measure- 


ment days). The concentration of the external medium was periodically changed between N,,, = 160 umoll" and N,, = O umol l" with a period length of 8 days. (3) Press perturbation (C10, 197 


measurement days). The dilution rate was increased from the standard value 6 = 0.55 per day in two steps to 6 = 1.2 per day at day 84 and 6 = 1.33 per day at day 123, suppressing the predator- 


prey oscillations. 
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For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 
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The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


a The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 
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A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 
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Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Measured data were first transfered to Excel-files (Versions 9-14) and subsequently archieved as csv-files. 


Data analysis Data analysis and the model were implemented in the language Julia 1.1 
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- Adescription of any restrictions on data availability 


Experimental data were made publicly available on Figshare — doi.org/10.6084/m9.figshare.10045976.v1. Data analysis and the model were implemented in the 
language Julia 1.1 (ref. 41). The source code for the wavelet analysis was made publicly available on https.//github.com/berndblasius/WaveletAnalysis. 
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Ecological, evolutionary & environmental sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description 


Research sample 


Sampling strategy 


Data collection 


Timing and spatial scale 


Data exclusions 


Reproducibility 


Randomization 


Blinding 


The aim of the study was to generate and analyse experimental long term data. Thus, the experimental design differs from the typical 
multi-)factorial design. From our time series, the relevant parameters were extracted. We had four replicates for the core long term 
experiments. Treatment factors were nutrient supply (constant vs. pulsed), type of algae (Monoraphidium vs Chlorella) and flow rate 
shift from standard flow rate to high rates). These were run in duplicate or triplicate, which is common for such long-term 
experimental studies (see Extended Data Table 1). Each run represents an individual independent chemostat vessel. Each vessel 
contained 800 ml culture medium. All experiments were run under stable conditions at 23 °C and permanent illumination. 


The experimental algae were Monoraphidium minutum (SAG-243-1, Gdttingen, Germany) and Chlorella vulgaris (UTEX 26, Texas, 
USA). Stock cultures were kept under non-limiting conditions to avoid synchronization of the cells. Animals were Brachionus 
calyciflorus, strain IGB, grown in stock cultures with the experimental algae under the same conditions. All species are cosmopolitan 
species occuring in various types of freshwaters. During the experiments, a daily sample of 8 ml was taken, representing 1 % of the 
total experimental volume. This sample size is large enough to quantify the number of algae and animals (see below) and small 
enough in order not to change the flow rate. 


The sample volume was constant throughout the study and chosen according to the expected number of algae/rotifers within the 
sample. Each experimental vessel contained up to several thousands of rotifers and up to several billions of algae. Since turbulent 
mixing prevents spatial heterogeneity both species are randomly distributed. A sample of 8 ml was taken (5 ml for the analysis of 
animals, 3 ml for the analysis of algae). The maximum abundance of the animals is higher than 100/ml so that a 5 ml sample is 
representative. As for the algae, the abundance is much higher (mio/ml), a smaller sub-sample was analysed using an electronic 
particle counter (CASY, Scharfe, Germany). Three sub-samples of 400uI were analysed and the mean was calculated. The coefficient 
of variation was on average 0.015. 


Data were collected daily, mostly by E. Denzin, or other staff members, guided by G. Weithoff and G. Fussmann. Each person who 
participated in sampling was individually trained to avoid sampling bias. The same holds for the quantitative analysis of algae and 
rotifers. The raw data from the electronic particle counter were archived on a computer and also transferred to Excel-files (Versions 
9 - 14). The data from the microscopic quantification of the animals were first recorded on paper in a laboratory folder and then 
transferred to Excel-files (Versions 9 - 14). The correct transfer was doublechecked in order to avoid any error through the transfer 
process. From these files, csv-files were created and used further onwards. 


The first experiments started in April 2001 and lasted until May 2009; data acquisition (counting) and analyses extended until 2019. 
Depending on human and spatial resources up to four chemostats were run simultaneously, often only two. The sampling frequency 
of daily samples was chosen in accordance to the growth rate of the experimental organisms. Daily samples allow for capturing the 
dynamic population behavior of the organisms and the subsequent analysis of population oscillations. 


Experimental runs were discarded from the data set when technical problems (leakage of tubes, blackout of pumps or illumination 
etc.) occurred before the data set was sufficiently long to extract the key parameters for this study. 


The reproducibility is part of the study, demonstrating that the findings were recurrent in several independent experimental runs. 


Treatments were set up independently and placed randomly in the climate chamber or different experimental runs, which were run 
subsequently, were run at the same spot. 


Blinding was neither possible, nor necessary. The correct type of medium, and flow rate had to be visible to the experimentalists. 
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Childhood malnutrition is associated with high morbidity and mortality globally’. 
Undernourished children are more likely to experience cognitive, physical, and 
metabolic developmental impairments that can lead to later cardiovascular disease, 


reduced intellectual ability and school attainment, and reduced economic 
productivity in adulthood’. Child growth failure (CGF), expressed as stunting, 
wasting, and underweight in children under five years of age (0-59 months), isa 
specific subset of undernutrition characterized by insufficient height or weight 
against age-specific growth reference standards? °. The prevalence of stunting, 
wasting, or underweight in children under five is the proportion of children witha 
height-for-age, weight-for-height, or weight-for-age z-score, respectively, that is more 
than two standard deviations below the World Health Organization’s median growth 
reference standards for a healthy population®. Subnational estimates of CGF report 
substantial heterogeneity within countries, but are available primarily at the first 
administrative level (for example, states or provinces)’; the uneven geographical 
distribution of CGF has motivated further calls for assessments that can match the 
local scale of many public health programmes®. Building from our previous work 
mapping CGF in Africa’, here we provide the first, to our knowledge, mapped high- 
spatial-resolution estimates of CGF indicators from 2000 to 2017 across 105 low- and 
middle-income countries (LMICs), where 99% of affected children live’, aggregated to 
policy-relevant first and second (for example, districts or counties) administrative- 
level units and national levels. Despite remarkable declines over the study period, 
many LMICs remain far from the ambitious World Health Organization Global 


Nutrition Targets to reduce stunting by 40% and wasting to less than 5% by 2025. Large 
disparities in prevalence and progress exist across and within countries; our maps 
identify high-prevalence areas even within nations otherwise succeeding in reducing 
overall CGF prevalence. By highlighting where the highest-need populations reside, 
these geospatial estimates can support policy-makers in planning interventions that 
are adapted locally and in efficiently directing resources towards reducing CGF and its 
health implications. 


Despite improvements in nearly all LMICs, stunting remained the 
most widespread and prevalent indicator of CGF throughout the study 
period. Overall, estimated childhood stunting prevalence across LMICs 
decreased from 36.9% (95% uncertainty interval, 32.8-41.4%) in 2000 
to 26.6% (21.5-32.4%) in 2017. Progress was particularly noticeable in 
Central America and the Caribbean, Andean South America, North 
Africa, and East Asia regions, and in some coastal central and western 
sub-Saharan African (SSA) countries, where most areas with estimated 
stunting prevalence of at least 50% in 2000 had reduced to 30% or less 
by 2017 (Fig. 1a, b). By 2017, zones with the highest prevalence of stunt- 
ing primarily persisted throughout much of the SSA, Central and South 
Asia, and Oceania regions, where large areas had estimated levels of 
at least 40%, such as in the first administrative-level units of Nigeria’s 
Jigawa state (60.6% (51.5-69.7%)), Burundi’s Karuzi province (60.0% 


(51.4-67.5%)), India’s Uttar Pradesh state (49.0% (48.5-49.5%)), and 
Laos’s Houaphan province (58.3% (50.7-66.8%)) (Extended Data Fig. 1). 
In 2017, Guatemala (47.0% (40.2-54.6%)), Niger (47.5% (42.2-53.9%)), 
Burundi (54.2% (46.3-61.2%)), Madagascar (49.8% (43.2-57.2%)), Timor- 
Leste (49.8% (43.4-56.2%)), and Yemen (45.4% (38.8-51.5%)) had the 
highest national-level stunting prevalence. 

Even within the aforementioned regions where reductions were most 
evident, local-level estimates revealed communities in which levels 
still approached those seen in SSA and South Asia; areas in southern 
Mexico and central Ecuador had estimated stunting prevalence of at 
least 40%, and areas in western Mongolia reached at least 30%. Wide 
within-country disparities were apparent in several instances, indicat- 
ing large areas left behind by the general pace of progress that require 
attention (Fig. 1a, b). Although most countries successfully reduced 


*A list of participants and their affiliations appears in the online version of the paper. 
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Fig. 1| Prevalence of stunting in children under five in LMICs (2000-2017) 
and progress towards 2025. a, b, Prevalence of stunting in children under five 
at the 5 x 5-km resolution in 2000 (a) and 2017 (b). c, Overlapping population- 
weighted tenth and ninetieth percentiles (lowest and highest) of 5 x 5-kmgrid 
cellsand AROC in stunting, 2000-2017. d, Overlapping population-weighted 
quartiles of stunting prevalence and relative 95% uncertainty in 2017. 


stunting prevalence, subnational inequalities (disparities between 
second administrative-level units (henceforth ‘units’)) remained 
widespread globally—especially evident in Vietnam, Honduras, Nigeria, 
and India (Extended Data Fig. 2). Among the top quintile of widest dis- 
parities, Indonesia experienced a twofold difference in stunting levels 
in 2017, ranging from 21.0% (16.2-27.0%) in Kota Yogyakarta regency 
(Yogyakarta province) to 51.5% (40.6-62.3%) in Sumba Barat regency 
(Nusa Tenggara Timur province). Stunting levels varied fourfold in 
Nigeria, ranging from 14.7% (9.1-21.0%) in Surulere Local Government 
Area (Lagos state) to 64.2% (54.2-74.6%) in Gagarawa Local Government 
Area (Jigawa state) in 2017. 

Evaluated from estimates of population-weighted prevalence for 
areas with the highest and lowest estimated prevalence of stunting 
(ninetieth and tenth percentiles, respectively), locations in central 
Chad, Pakistan, and Afghanistan, in northeastern Angola, and through- 
out the Democratic Republic of the Congo and Madagascar had among 
the lowest annualized rates of change (AROC), indicating stagnation or 
increase over the study period (Fig. 1c); in 2017, these countries also had 
large geographical areas among the most highly prevalent for stunting. 
By contrast, areas scattered throughout Peru, northwestern Mexico, 
and eastern Nepal had among the highest stunting levels in 2000, but 
also the highest rates of decline; by 2017, many of these areas were 
subsequently no longer in the highest-prevalence decile. 

The absolute number of children under five who were stunted 
was also unequally distributed (Fig. le, f), with a large proportion 
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e, f, Number of children under five who were stunted, at the 5 x 5-km (e) and 
first-administrative-unit (f) levels. g, 2000-2017 annualized decrease in 
stunting prevalence relative to rates needed during 2017-2025 to meet the 
WHO GNT.h, Grid-cell-level predicted stunting prevalence in 2025. Maps were 
produced using ArcGIS Desktop 10.6. Interactive visualization tools are 
available at https://vizhub.healthdata.org/Ibd/cgf. 


concentrated in a few nations in 2017; overall, 85.1% (84.4-85.7%) 
of all stunted children under five lived in Africa or Asia. Of the 
176.1 million (151.6-203.3 million) children who were stunted in 2017, 
just over half (50.1% (48.5-52.0%)) lived in only four countries: India 
(51.5 million (47.7-55.3 million) children; 28.6% (27.1-30.4%) of global 
stunting), Pakistan (10.7 million (9.3-12.1 million); 6.8% (6.7-6.9%)), 
Nigeria (11.8 million (10.7-13.0 million); 6.6% (6.4-6.8%)), and China 
(16.2 million (14.0-18.5 million); 9.0% (9.1-8.9%)). Although China 
had a low prevalence of national stunting (10.8% (9.1-12.6%)) in 
2017, the prevalence was high in India (39.3% (39.1-39.6%)), Pakistan 
(44.0% (38.4-49.9%)), and Nigeria (38.2% (34.5-42.0%)). Even with mod- 
erate levels of stunting (10 to <20%)"°, these highly populous coun- 
tries would substantially contribute to the global share owing to their 
population size, and reducing their levels would markedly decrease 
the number of stunted children. 

Childhood wasting was less widespread than stunting (Fig. 2a, b), 
affecting 8.4% (7.9-9.9%) of children under five in LMICs in 2000, and 
6.4% (4.9-7.9%) by 2017. Wasting reached critical levels (at least 15%)" 
nationally in 13 LMICs in 2000 and 7 LMICs in 2017, although only 
in Mauritania (20.7% (16.5-25.6%)) did all units exceed these levels 
(Extended Data Fig. 3). Critical wasting prevalence was concentrated in 
few areas across the globe in 2017, including the peri-Sahelian areas of 
countries stretching from Mauritania to Sudan, as wellas areas in South 
Sudan, Ethiopia, Kenya, Somalia, Yemen, India, Pakistan, Bhutan, and 
Indonesia. Most LMICs reduced within-country disparities between 
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Fig. 2| Prevalence of wasting in children under five in LMICs (2000-2017) 
and progress towards 2025. a, b, Prevalence of child wasting in children under 
five at the 5 x 5-km resolution in 2000 (a) and 2017 (b).c, Overlapping 
population-weighted tenth and ninetieth percentiles (lowest and highest) of 
5x 5-km grid cells and AROC in wasting, 2000-2017. d, Overlapping 
population-weighted quartiles of wasting prevalence and relative 95% 


their highest- and lowest-prevalence units between 2000 and 2017, 
most notably in Algeria, Uzbekistan, and Egypt (Extended Data Fig. 4). 
Even against a backdrop of national-level declines, however, broad 
within-country disparities in wasting remained in countries such as 
Indonesia, Ethiopia, Nigeria, and Kenya. An estimated ninefold dif- 
ference in wasting prevalence occurred among Kenya’s units in 2017, 
ranging from 2.9% (1.6-4.9%) in Tetu constituency (Nyeri county) to 
28.3% (20.2-37.3%) in Turkana East constituency (Turkana county); 
higher-resolution estimates reveal areas with a wasting prevalence of 
at least 25%. High-prevalence areas in 2000 typically remained within 
the highest population-weighted decile for wasting in 2017, including 
the units of Rabkona county (Unity state) in northern South Sudan 
(27.8% (19.8-37.6%) in 2000; 17.3% (8.8-21.9%) in 2017), the Tanout 
department (Zinder region) in southern Niger (21.6% (17.3-26.7%) 
in 2000; 16.5% (11.3-23.3%) in 2017), and Alor regency (Nusa Tenggara 
Timur province) in southeastern Indonesia (16.4% (9.6-25.8%) in 2000; 
20.7% (12.8-30.3%) in 2017) (Fig. 2c). 

The absolute number of children affected by wasting was unequal 
both across and within countries (Fig. 2e, f). Of the 58.3 million (47.6- 
70.7 million) children affected by wasting in 2017, 57.1% (52.7-61.6%) 
occurred in four of the most populous countries: India (26.1 million 
(23.1-29.0 million); 44.7% (41.0-48.6%) of global wasting), Pakistan 
(3.5 million (2.8-4.3 million); 6.0% (5.8-6.1%)), Bangladesh (1.8 mil- 
lion (1.2-2.4 million); 3.0% (2.6-3.4%)), and Indonesia (2.0 million 
(1.7-2.3 million); 3.4% (3.3-3.5%)). On the basis of standard thresholds", 


uncertainty in 2017. e, f, Number of children under five affected by wasting, at 
the 5x 5-km (e) and first-administrative-unit (f) levels. g, 2000-2017 annualized 
decrease in wasting prevalence relative to rates needed during 2017-2025 to 
meet the WHO GNT.h, Grid-cell-level predicted wasting prevalence in 2025. 
Maps were produced using ArcGIS Desktop 10.6. Interactive visualization tools 
are available at https://vizhub.healthdata.org/Ibd/cgf. 


these countries had serious levels of national wasting prevalence 
(10 to <15%), ranging from 12.2% (9.7-14.9%) in Pakistan to 15.7% (15.5- 
15.9%) in India, and all but Bangladesh had areas with estimated wasting 
levels above 20%; increased efforts, especially in densely populated 
areas with high prevalence and absolute numbers, could immensely 
reduce global child wasting. 

The prevalence of underweight—a composite indicator of stunting 
and wasting—followed the scattered pattern of high-stunting areas in 
SSA and spanning Central Asia to Oceania, and the high prevalence belt 
of wasting along the African Sahel (Extended Data Fig. 5a, b). Affecting 
19.8% (17.3-22.7%) of children under five across LMICs in 2000 and 
13.0% (10.4-16.0%) in 2017, reductions in underweight prevalence were 
most notable for countries in Central and South America, southern 
SSA, North Africa, and Southeast Asia. For example, by 2017, estimated 
underweight prevalence had decreased to less than or equal to 20% 
for nearly all areas in Namibia. By contrast, peri-Sahelian countries 
stretching from Mauritania to Somalia maintained an estimated under- 
weight prevalence of at least 30% in many areas. Large geographical 
areas across Central and South Asia also maintained high prevalence 
of underweight during the study period; in particular, India, Pakistan, 
and Bangladesh sustained estimated prevalence of at least 30% in most 
locations. Although levels of child underweight had largely reduced 
since 2000, within-country disparities remained widespread; 71.4% 
(75 out of 105) of LMICs experienced at least a twofold difference across 
units in 2017 (Extended Data Fig. 6). 
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Prospects for reaching 2025 targets 


We estimate that broad areas across Central America and the Caribbean, 
South America, North Africa, and East Asia had high probability (+95%) 
of having already achieved targets for both stunting and wasting in 
2017 (Extended Data Fig. 7). Exceptions to these regional patterns exist; 
areas with stagnated progress and less than 50% probability of having 
achieved the World Health Organization’s Global Nutrition Targets 
for 2025 (WHO GNTs) in 2017 were found throughout much of Gua- 
temala and Ecuador for stunting and in southern Venezuela for wast- 
ing (Figs. 1g, 2g, Extended Data Fig. 7). Even within countries that had 
achieved targets, there remain areas with slow progress; locations in 
central Peru for stunting and southwestern South Africa for wasting 
had not achieved targets in 2017 (less than 5% probability)—nuances 
otherwise hidden by aggregated estimates. Owing to stagnation or 
increases in prevalence, broad areas in SSA and substantial portions 
across Central Asia, South Asia, and Oceania (for example, inthe Demo- 
cratic Republic of the Congo and Pakistan for stunting; in Yemen and 
Indonesia for wasting) require reversal of trends or acceleration of 
declines in order to meet international targets (Figs. 1g, 2g). 

Despite predicted improvements in AROC for 2017-2025, many 
highly affected countries are predicted to have areas that maintain 
estimated stunting levels of at least 40% or wasting levels of at least 
15% in 2025 (Figs. 1h, 2h). Accounting for uncertainty in 2000-2017 
AROC estimates, and with 2010 national-level estimates as a baseline 
for the 40% stunting reduction target, 44.8% (47 out of 105) of LMICs 
are estimated to nationally meet WHO GNT (>95% probability) for stunt- 
ing by 2025 (Supplementary Table 13). At finer scales, 17.1% (n = 18) 
and 7.6% (n= 8) of LMICs will meet the stunting target in all first and 
second administrative-level units in 2025, respectively (Extended Data 
Fig. 8a,d, Supplementary Table 13). Similarly, 35.2% (n = 37) of LMICs are 
estimated to reduce to or maintain less than 5% wasting prevalence by 
2025 (>95% probability) based on current trajectories (Supplementary 
Table 13). Fewer countries were estimated to meet wasting targets in all 
first administrative-level (16.2% (n=17)) or second administrative-level 
(9.5% (n=10)) units (Extended Data Fig. 8b, e, Supplementary Table 13). 
Only 26.7% (n = 28) of LMICs will meet national-level targets for both 
stunting and wasting by 2025, and only 4.8% (n = 5) will achieve both 
targets in all units (Supplementary Table 13). 


Discussion 


Although commendable declines in CGF have occurred globally, this 
progress measured at a coarse scale conceals subnational and local 
underachievement and variation in achieving the WHO GNTs. Sup- 
porting conclusions inthe Global Nutrition Report”, our results show 
that most LMICs will not reach WHO GNTs nationally, and even fewer 
will meet targets across subnational units. Our mapped results show 
broad heterogeneity across areas, and reveal hotspots of persistent CGF 
even within well-performing regions and countries, where increased 
and targeted efforts are needed. In 2017, one in four children under 
five across LMICs still suffered at least one dimension of CGF, and the 
largest numbers of affected children were often in specific within- 
country locations. Although the national prevalence of CGF was gener- 
ally lower in Central America and the Caribbean, South American, and 
East Asian countries, there are communities in these regions in which 
levels of CGF remain as high as those in SSA and South Asia. Regardless 
of overall declines, many subnational areas across LMICs maintained 
high levels of CGF and require substantial acceleration of progress or 
reversal of increasing trends to meet nutrition targets and leave no 
populations behind. 
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To our knowledge, this study is the first to estimate CGF compre- 
hensively across LMICs ata fine geospatial scale, providing a precision 
public health tool to support efficient targeting of local-level interven- 
tions to vulnerable populations. Although densely populated areas 
may have relatively low prevalence of CGF, the absolute number of 
affected children may still be high; thus, both relative and absolute 
estimates are important to determine where additional attention is 
needed. To achieve international goals, more concerted efforts are 
needed in areas with decreasing or stagnating trends, without dimin- 
ishing support in areas that demonstrate progress nor contributing to 
increases in obesity. In future work, we plan to determine how tostratify 
our estimates of CGF by sex and age, assess the double burden of child 
undernutrition and overweight, analyse important maternal indicators 
that affect child nutritional status outcomes (such as anaemia), and 
continue to monitor progress towards the 2025 WHO GNTs. These 
mapped estimates enable decision-makers to visualize and compare 
subnational CGF and nutritional inequalities, and identify populations 
most in need of interventions”. 
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Methods 


Overview 

Building from our previous study of CGF in Africa’, we used Bayesian 
model-based geostatistics*—which leveraged geo-referenced sur- 
vey data and environmental and socioeconomic covariates, and the 
assumption that points with similar covariate patterns and that are 
closer to one another in space and time would be expected to have 
similar patterns of CGF—to produce high-spatial-resolution estimates 
of the prevalence of stunting, wasting, and underweight among chil- 
dren under five across LMICs. Stunting, wasting, and underweight were 
defined as z-scores that were two or more standard deviations below 
the WHO healthy population reference median for length/height-for- 
age, weight-for-length/height, and weight-for-age, respectively, for 
age- and sex-specific curves*. Using an ensemble modelling framework 
that feeds into a Bayesian generalized linear model with a correlated 
space-time error, and 1,000 draws from the fitted posterior distribu- 
tion, we generated estimates of annual prevalence for each indicator of 
CGF ona5x5-km grid over 105 LMICs for each year from 2000 to 2017 
and mapped results at administrative levels to provide relevant sub- 
national information for policy planning and public health action. For 
this analysis, we compiled an extensive geo-positioned dataset, using 
data from 460 household surveys and reports representing 4.6 mil- 
lion children. To ensure comparability with national estimates and to 
facilitate benchmarking, these local-level estimates were calibrated 
to those produced by the Global Burden of Disease (GBD) Study 2017’, 
and were subsequently aggregated to the first administrative level (for 
example, states or provinces) and second administrative level (for 
example, districts or departments) in each LMIC. We also predict CGF 
prevalence for 2025 based on 2000-2017 trajectories and estimate 
the AROC required to meet the WHO GNTs by 2025. In addition, we 
estimate the 2017 absolute numbers of children under five affected by 
each CGF indicator in LMICs based on our prevalence estimates and 
the size of the populations of children under five*"*. Furthermore, we 
provide figures that demonstrate subnational disparities between each 
country’s second administrative-level units with the highest and lowest 
estimated prevalence for 2000 and 2017 (Extended Data Figs. 2, 4, 6). 
We re-estimate CGF prevalence for the 51 African countries included 
in our previous analysis’ using 28 additional surveys, and extend time 
trends to model each year from 2000 to 2017. Owing to these improve- 
ments in data availability and methodology, the estimates provided 
here supersede our previous modelling efforts. 

Countries were selected for inclusion in this study using the socio- 
demographic index (SDI)—a summary measure of development that 
combines education, fertility, and poverty, published in the GBD study’. 
The analyses reported here include countries in the low, low-middle, 
and middle SDI quintiles, with several exceptions (Supplementary 
Table 3). China, Iran, Libya, and Malaysia were included despite high- 
middle SDI status in order to create better geographical continuity. 
Albania and Moldova were excluded owing to geographical disconti- 
nuity with other included countries and lack of available survey data. 
We did not estimate for the island nations of American Samoa, Feder- 
ated States of Micronesia, Fiji, Kiribati, Marshall Islands, North Korea, 
Samoa, Solomon Islands, or Tonga, where no available survey data 
could be sourced. The flowchart of our modelling process is provided 
in Extended Data Fig. 9. 


Surveys and child anthropometry data 

We extracted individual-level height, weight, and age data for children 
under five from household survey series including the Demographic 
and Health Surveys (DHS), Multiple Indicator Cluster Surveys (MICS), 
Living Standards Measurement Study (LSMS), and Core Welfare Indica- 
tors Questionnaire (CWIQ), among other country-specific child health 
and nutrition surveys”? (Supplementary Tables 4, 5). Included in our 
models were 460 geo-referenced household surveys and reports from 


105 countries representing approximately 4.6 million children under 
five. Each individual child record was associated witha cluster, agroup 
of neighbouring households or a ‘village’ that acts as a primary sampling 
unit. Some surveys included geographical coordinates or precise place 
names for each cluster within that survey (138,938 clusters for stunt- 
ing, 144,460 for wasting, and 147,624 for underweight). Inthe absence 
of geographical coordinates for each cluster, we assigned data to the 
smallest available administrative areal unit inthe survey (termed a ‘poly- 
gon’) while correcting for the survey sample design (16,554 polygons 
for stunting, 18,833 for wasting, and 19,564 for underweight). Boundary 
information for these administrative units was obtained as shapefiles 
either directly from the surveys or by matching to shapefiles in the 
Global Administrative Unit Layers (GAUL)” or the Database of Global 
Administrative Areas (GADM)”. In select cases, shapefiles provided by 
the survey administrator were used, or custom shapefiles were created 
based on survey documentation. These areal data were resampled to 
point locations using a population-weighted sampling approach over 
the relevant areal unit with the number of locations set proportionally 
to the number of grid cells in the area and the total weights of all the 
resampled points summing to one”®. 

Select data sources were excluded for the following reasons: miss- 
ing survey weights for areal data, missing sex variable, insufficient 
age granularity (in months) for calculations of length/height-for-age 
z-scores and weight-for-age z-scores in children ages 0-2 years, incom- 
plete sampling (for example, only children ages O-3 years measured), 
or untrustworthy data (as determined by the survey administrator 
or by inspection). We excluded data for children for whom we could 
not compute age in both months and weeks. Children with height val- 
ues <O cm or 2180 cm, and/or with weight values <0 kg or >45 kg were 
also excluded from the study. We also excluded data that were con- 
sidered outliers according to the 2006 WHO Child Growth Standards 
recommended range values, which were values <—6 or >6 length/height- 
for-age z-score for stunting, <—5 or >5 weight-for-length/height z-score 
for wasting, and <-6 or >5 weight-for-age z-score for underweight**. 
Details on the survey data excluded for each country are provided in 
Supplementary Table 6. Data availability plots for all the CGF indicators 
by country, type, and year are included in Supplementary Figs. 2-16. 


Child anthropometry 

Using the height, weight, age, and sex data for each individual, height- 
for-age, weight-for-height, and weight-for-age z-scores were calculated 
using the age-, sex-, and indicator-specific LMS (lambda-mu-sigma) 
values from the 2006 WHO Child Growth Standards**. The LMS meth- 
odology allows for Gaussian z-score calculations and comparisons 
to be applied to skewed, non-Gaussian distributions”. We classified 
stunting, wasting, or underweight ifthe height/length-for-age, weight- 
for-height/length, or weight-for-age, respectively, was more than two 
standard deviations (z-scores) below the WHO growth reference popu- 
lation’. These individual-level data observations were then collapsed to 
cluster-level totals for the number of children sampled and total num- 
ber of children under five affected by stunting, wasting, or underweight. 


Temporal resolution 

We estimated the prevalence of stunting, wasting, and underweight 
annually from 2000 to 2017 using a model that allows us to account for 
data points measured across survey years. As such, the model would 
also allow us to predict at monthly or finer temporal resolutions; how- 
ever, we are limited both computationally and by the temporal resolu- 
tion of the covariates. 


Seasonality adjustment 

Owing to the acute nature of wasting and its relative temporal transi- 
ence, wasting data were pre-processed to account for seasonality within 
each year of observation. Across LMICs, large proportions of the popu- 
lation live in rural areas and have livelihoods that rely on agriculture 


and livestock. Seasonality affects the availability of and access to food, 
sometimes owing to natural disasters or climate events (for example, 
floods, monsoons, or droughts) that vary by season. Generalized addi- 
tive models were fit to wasting data across time using the month of 
interview and acountry-level fixed effect as the explanatory variables, 
and the wasting z-score as the response. A 12-month periodic spline for 
the interview month was used, as well as a spline that smoothed across 
the whole duration of the dataset. Once the models were fit, individual 
weight-for-height/length z-score observations were adjusted so that 
each measurement was consistent with a day that represented amean 
day in the periodic spline. The seasonality adjustment had relatively 
little effect on the raw data’. 


Spatial covariates 
To leverage strength from locations with observations to the entire 
spatiotemporal domain, we compiled several 5 x 5-km raster layers of 
possible socioeconomic and environmental correlates of CGF in the 
105LMICs (Supplementary Table 7, Supplementary Fig. 17). Covariates 
were selected based on their potential to be predictive for the set of 
CGF indicators, after reviewing literature on evidence and plausible 
hypotheses as to their influence. Acquisition of temporally dynamic 
datasets, where possible, was prioritized to best match our observa- 
tions and thus predict the changing dynamics of the CGF indicators. 
Of the twelve covariates included, eight were temporally dynamic and 
were reformatted as a synoptic mean over each estimation period 
or asa mid-period year estimate: these covariates included average 
daily mean rainfall (precipitation), average daily mean temperature, 
enhanced vegetation index, fertility, malaria incidence, educational 
attainment in women of reproductive age (15-49 years old), popula- 
tion, and urbanicity. The remaining four covariate layers were static 
throughout the study period and were applied uniformly across all 
modelling years; growing season length, irrigation, nutritional yield for 
vitamin A, and travel time to nearest settlement of >50,000 inhabitants. 
Toselect covariates and capture possible nonlinear effects and com- 
plex interactions between them, an ensemble covariate modelling 
method was implemented”. For each region, three sub-models were 
fit to our dataset using all of our covariate data as explanatory predic- 
tors; these sub-models were: generalized additive models, boosted 
regression trees, and lasso regression. Each sub-model was fit using 
fivefold cross-validation to avoid overfitting, and the out-of-sample 
predictions from across the five holdouts were compiled into a single 
comprehensive set of predictions from that model. In addition, the 
same sub-models were run using 100% of the data, and a full set of 
in-sample predictions were created. The three sets of out-of-sample 
sub-model predictions were fed into the full geostatistical model” as 
the explanatory covariates when performing the model fit. The in- 
sample predictions from the sub-models were used as the covariates 
when generating predictions using the fitted full geostatistical model. 
Arecent study demonstrated that this ensemble approach can improve 
predictive validity by up to 25% over an individual model”’. 


Geostatistical model analysis 

Binomial count data were modelled within a Bayesian hierarchical 
modelling framework using a logit link function and a spatially and 
temporally explicit hierarchical generalized linear regression model to 
fit prevalence of each of our indicators in 14 regions™ of LMICs (North 
Africa, western SSA, central SSA, eastern SSA, southern SSA, Middle 
East, Central Asia, East Asia, South Asia, Southeast Asia, Oceania, Cen- 
tral America and the Caribbean, Andean South America, and Tropical 
South America; see Extended Data Fig. 10). For each region, we explicitly 
wrote the hierarchy that defines our Bayesian model. 

For each binomial CGF indicator, we modelled the average number 
of children with stunting, wasting, or who were underweight in each 
survey cluster, d. Survey clusters are precisely located by their GPS 
coordinates and year of observation, which we map to a spatial raster 


location, i, at time, t. We observed the number of children reported to 
be stunted, wasted, or underweight, respectively, as binomial count 
data, C,,among an observed sample size, N,. AS we may have observed 
several data clusters within a given location, i, at time, t, we refer to 
the probability of stunting, wasting, or underweight, p, withina given 
cluster, d, by its indexed location, i, and time, ¢, as Pia),ua)- 
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For indices d, i, and t, *(index) is the value of * at that index. The prob- 
abilities, p;,, represent both the annual prevalence at the space-time 
location and the probability that an individual child was afflicted with 
the risk factor given that they lived at that particular location. The 
annual prevalence, p;,, of each indicator was modelled as alinear com- 
bination of the three sub-models (generalized additive model, boosted 
regression trees, and lasso regression), rasterized covariate values, X;,,, 
acorrelated spatiotemporal error term, Z,,,and country randomeffects, 
Ect, With one unstructured country random effect fit for each country 
inthe modelling region and all €,,, sharing acommon variance param- 
eter, y’, and anindependent nugget effect, ¢,,, with variance parameter, 
o°. Coefficients in B, in the three sub-models h = 1, 2,3 represent their 
respective predictive weighting in the mean logit link, while the joint 
error term, Z,,, accounts for residual spatiotemporal autocorrelation 
between individual data points that remains after accounting for the 
predictive effect of the sub-model covariates, the country-level random 
effect, €.,,), and the nugget independent error term, ¢;,. The residuals, 
Z,, are modelled as a three-dimensional Gaussian process (GP) inspace- 
time centred at zero and with a covariance matrix constructed froma 
Kronecker product of spatial and temporal covariance kernels. The 
spatial covariance, 2°", is modelled using an isotropic and stationary 
Matérn function”, and temporal covariance, 2", as an annual autore- 
gressive (AR1) function over the 18 years represented in the model. In 
the stationary Matérn function, I is the gamma function, K, is the 
modified Bessel function of order v> 0, x > Ois ascaling parameter, 
D denotes the Euclidean distance, and w’ is the marginal variance. The 
scaling parameter, x, is defined to be x= (8/6 in which dis a range 
parameter (which is about the distance where the covariance function 
approaches 0.1) and visascaling constant, whichis set to 2 rather than 
fit from the data”*””. This parameter is difficult to reliably fit, as docu- 
mented by many other analyses”°”5” that set this to 2. The number of 
rows and thenumber of columns of the spatial Matérn covariance matrix 
are both equal to the number of spatial mesh points for a given model- 
ling region. Inthe AR1 function, pis the autocorrelation function (ACF), 
and kand are points in the time series where |k — j| defines the lag. The 
number of rows and the number of columns of the AR1 covariance matrix 
are both equal to the number of temporal mesh points (18). The number 
of rows and the number of columns of the space-time covariance 
matrix, 2°?*°° @ x", for a given modelling region are both equal to: (the 
number of spatial mesh points x the number of temporal mesh points). 
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This approach leveraged the residual correlation structure of the 
datato more accurately predict prevalence estimates for locations with 
no data, while also propagating the dependence in the data through 
to uncertainty estimates“. The posterior distributions were fit using 
computationally efficient and accurate approximations in R-INLA*” 
(integrated nested Laplace approximation) with the stochastic partial 
differential equations (SPDE)” approximation to the Gaussian pro- 
cess residuals using R project v.3.5.1. The SPDE approach using INLA 
has been demonstrated elsewhere, including the estimation of health 
indicators, particulate air matter, and population age structure’ ®. 
Uncertainty intervals were generated from 1,000 draws (that is, sta- 
tistically plausible candidate maps)” created from the posterior-esti- 
mated distributions of modelled parameters. Further details on model 
and estimation processes are provided in the Supplementary Infor- 
mation. 


Post estimation 

To leverage national-level data included in the 2017 GBD study! that 
were not within the scope of our current geospatial modelling frame- 
work, and to ensure alignment between these estimates and GBD 
national-level and subnational estimates, we performed a post hoc 
calibration to the mean of the 1,000 draws. We calculated popula- 
tion-weighted aggregations to the GBD estimate level, which was 
either at the national or first administrative level, and compared 
these estimates to our corresponding year estimates from 2000 to 
2017. We defined the calibration factor to be the ratio between the 
GBD estimates and our current estimates for each year from 2000 to 
2017. For some selected countries where GBD estimates were at the 
first administrative level, the calibration factors were also calculated 
at the lowest available subnational level. These countries included 
Brazil, China, Ethiopia, India, Indonesia, Iran, Mexico, and South 
Africa. Finally, we multiplied each of our estimates in a country-year 
(or first-administrative-year) by its associated factor. This ensures 
consistency between our geospatial estimates and those of the 2017 
GBD’, while preserving our estimated within-country geospatial and 
temporal variation. To transform grid-cell-level estimates into arange 
of information useful to a wide constituency of potential users, these 
estimates were aggregated at first and second administrative-level 
units specific to each country and at national levels using conditional 
simulation”. 

Although the models can predict all locations covered by available 
raster covariates, all final model outputs for which land cover was clas- 
sified as ‘barren or sparsely vegetated’ on the basis of the most recently 
available Moderate Resolution Imaging Spectroradiometer (MODIS) 
satellite data (2013) were masked’®. Areas where the total population 
density was less than ten individuals per 1 x 1-km grid cell were also 
masked in the final outputs. 


Model validation 

We assessed the predictive performance of the models using fivefold 
out-of-sample cross-validation strategies and found that our preva- 
lence estimates closely matched the survey data. To offer a more 
stringent analysis by respecting some of the spatial correlation in the 
data, holdout sets were created by combining sets of data at different 
spatial resolutions (for example, first administrative level). Validation 
was performed by calculating bias (mean error), variance (root mean 
square error), 95% data coverage within prediction intervals, and cor- 
relation between observed data and predictions. All validation metrics 
were calculated on the out-of-sample predictions from the fivefold 
cross-validation. Furthermore, measures of spatial and temporal auto- 
correlation pre- and post-modelling were examined to verify correct 
recognition, fitting, and accounting for the complex spatiotemporal 
correlation structure in the data. All validation procedures and cor- 
responding results are included in Supplementary Tables 14-22 and 
Supplementary Figs. 24-41. 


Projections 
To compare our estimated rates of improvement in CGF prevalence 
over the last 18 years with the improvements needed between 2017 
and 2025 to meet WHO GNTs, we performed asimple projection using 
estimated annualized rates of change (AROC) applied to the final year 
of our estimates. 

For each CGF indicator, u, we calculated AROC at each grid cell, m, by 
calculating the AROC between each pair of adjacent years, ¢: 


D 
AROC,, m,¢ = logit] "— 
u,m,t-1 


We then calculated a weighted AROC for each indicator by taking 
a weighted average across the years, where more recent AROCs were 
given more weight in the average. We defined the weights to be: 


W, = (t-2000 +1)” 


inwhich y may be chosen to give varying amounts of weight across the 
years. For any indicator, we then calculated the average AROC to be: 


2017 
AROC,, m= out D> Wx AROC, mc 
2001 


Finally, we calculated the projections, Proj, by applying the AROC in 
our 2017 mean prevalence estimates to produce estimates in 8 years 
from 2017 to 2025. For this set of projections, we selected y =1.7 for 
stunting, y=1.9 for wasting, and y= 1.8 for underweight’. 


Proj = logit ‘(logit(y, m 2017) FAROC,,m * 8) 


u,m,2025 

This projection scheme is analogous to the methods used in the 2017 
GBD measurement of progress and projected attainment of health- 
related Sustainable Development Goals’. Our projections are based 
on the assumption that areas will sustain the current AROC, and the 
precision is dependent on the level of uncertainty emanating fromthe 
estimation of annual prevalence. 

Although the WHO GNT for wasting was to reduce prevalence to 
less than 5%, the WHO GNT for stunting was a 40% relative reduction 
in prevalence. For our analyses, we defined the WHO GNT for stunt- 
ing and underweight (for which no WHO GNT was established) to be 
40% reduction relative to 2010, the year the World Health Assembly 
requested the development of the WHO GNTs”’. 


Limitations 

The accuracy of our models depends on the volume, representative- 
ness, quality, and validity of surveys available for analysis (Supple- 
mentary Tables 4, 5, Supplementary Figs. 2-16). Persistent data gaps 
in national surveys include a lack of CGF data or household-level char- 
acteristics, such as hygiene and sanitation practices. The associated 
uncertainties of our estimates are higher in areas where dataare either 
missing or less reliable (Figs. 1d, 2d, Extended Data Fig. 5d), and rely 
more heavily on covariates and borrowing from neighbouring areas for 
their modelling (Supplementary Table 7, Supplementary Fig. 17). Invest- 
ments in improvements of health surveillance systems and including 
child anthropometrics as part of routine data collection for profiling 
population characteristics could improve the certainty of our estimates 
and better monitor progress towards international goals. In addition, 
measurementerror in collecting anthropometric information, includ- 
ing the child’s age, height, and weight, could have introduced bias or 
error in the data across different survey types. The accuracy of age 
data may be affected by differences in sampling approaches and self- 
reporting bias, suchas long recall period or selective recall. Weight and 
height measurements may be inaccurate owing to improper calibration 


of equipment, device inaccuracy, different measurement methods, or 
human error. We did not include a survey random effect to account for 
between-survey variability in data accuracy; given that most surveys 
represent a country-year, it would be difficult to distinguish these biases 
from temporal effects. Our calibration approachin the post-estimation 
process used only aratio estimator and did not account for an additive 
effect, which may have introduced bias. Owing to the complexity of the 
boosted regression tree sub-model, we were unable to account for the 
uncertainty of our three sub-models in our final estimates (see Supple- 
mentary Information section 3.2.2 for more detail). It is worth noting 
that our analyses are descriptive and do not support causal inferences 
ontheir own. Future research is required to determine the causal path- 
ways for each CGF indicator across and within LMICs. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


CGF estimates can be further explored at various spatial scales 
(national, administrative, and local levels) through our customized 
online data visualization tools (https://vizhub.healthdata.org/Ibd/ 
cgf). The full output of the analyses and the underlying data used in 
the analyses are publicly available via the Global Health Data Exchange 
(GHDx; http://ghdx.healthdata.org/record/ihme-data/Imic-child- 
growth-failure-geospatial-estimates-2000-2017). Some data sources 
are under special licenses for the current study and are thus not publicly 
available. Supplementary Tables 4 and 5 show the incorporated data 
sources, and data with restrictions are marked with an obelisk symbol 
(t). All maps presented in this study are generated by the authors and 
no permissions are required to publish them. 

The findings of this study are supported by data available in public 
online repositories, data publicly available upon request of the data pro- 
vider, and data not publicly available owing to restrictions by the data 
provider. Non-publicly available data were used under license for the 
current study but may be available from the authors upon reasonable 
request and with permission of the data provider. Detailed tables and 
figures of datasources and availability can be found in Supplementary 
Tables 4, 5, and Supplementary Figs. 2-16. 

Administrative boundaries were retrieved from the Global Admin- 
istrative Unit Layers (GAUL)” or the Database of Global Administra- 
tive Areas (GADM)”.. Land cover was retrieved from the online Data 
Pool, courtesy of the NASA EOSDIS Land Processes Distributed Active 
Archive Center (LP DAAC), USGS/Earth Resources Observation and Sci- 
ence (EROS) Center, Sioux Falls, South Dakota*®. Lakes were retrieved 
from the Global Lakes and Wetlands Database (GLWD), courtesy of 
the World Wildlife Fund and the Center for Environmental Systems 
Research, University of Kassel“’. Populations were retrieved from 
WorldPop’”®. All maps in this study were produced using ArcGIS 
Desktop 10.6. 


Code availability 


Our study follows the Guidelines for Accurate and Transparent Health 
Estimate Reporting (GATHER; Supplementary Table 1). All code used 
for these analyses is publicly available online http://ghdx.healthdata. 
org/record/ihme-data/Imic-child-growth-failure-geospatial-estimates- 
2000-2017 and at http://github.com/ihmeuw/Ibd/tree/cgf-Imic-2019. 
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Extended Data Fig. 1| Prevalence of stunting in children under five in LMICs 
at administrative levels 0, 1,2, and at 5 x 5-kmresolution in 2017. 
Administrative level 0 are national-level estimates; administrative level lare 
first administrative-level (for example, states or provinces) estimates; 
administrative level 2 are second administrative-level (for example, districts or 


departments) estimates. Maps reflect administrative boundaries, 

land cover, lakes, and population; grey-coloured grid cells had fewer than ten 
people per 1x 1-km grid cell and were classified as ‘barren or sparsely 
vegetated’>16707140-42, or were not included in these analyses. Maps were 
produced using ArcGIS Desktop 10.6. 


All Stage 2 countries ranked by median prevalence across admin2 units in 2017 
Highest and lowest admin2 units shown as error bars; 2000 prevalence levels shown in grey for comparison 
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Extended Data Fig. 2 | Geographical inequality in the prevalence of child each country. Bars indicating the range in 2017 are coloured according to the 
stunting across 105 countries. The bars represent the range of stunting regions defined by the Global Burden of Disease (GBD)'. Grey bars indicate the 


prevalence in children under five in the second administrative-level units in range in 2000. The graph was produced using R project v.3.5.1. 
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Extended Data Fig. 3 | Prevalence of wasting in children under five in LMICs grid cells had fewer than ten people per 1x 1-km grid cell and were classified as 
at administrative levels 0, 1,2, and at 5 x 5-kmresolution in 2017. ‘barren or sparsely vegetated’>*?02140-42, or were not included in these 
Administrative levels are as described in Extended Data Fig. 1. Maps reflect analyses. Maps were produced using ArcGIS Desktop 10.6. 


administrative boundaries, land cover, lakes, and population; grey-coloured 


All Stage 2 countries ranked by median prevalence across admin2 units in 2017 
Highest and lowest admin? units shown as error bars; 2000 prevalence levels shown in grey for comparison 
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Extended Data Fig. 5 | Prevalence of underweight in children under five in 
LMICs (2000-2017) and progress towards 2025. a,b, Prevalence of 
underweight in children under five at the 5 x 5-km resolution in 2000 (a) and 
2017 (b).c, Overlapping population-weighted tenth and ninetieth percentiles 
(lowest and highest) of 5 x 5-km grid cells and AROC in underweight, 2000- 


2017. d, Overlapping population-weighted quartiles of underweight 
prevalence and relative 95% uncertainty in 2017. e, f, Number of underweight 
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children under five, at the 5 x 5-km (e) and first-administrative-unit (f) levels. 
g, 2000-2017 annualized decrease in underweight prevalence relative to rates 
needed during 2017-2025 to meet WHO GNT. h, Grid-cell-level predicted 
underweight prevalence in 2025. Maps were produced using ArcGIS Desktop 
10.6. Interactive visualization tools are available at https://vizhub.healthdata. 
org/Ibd/cgf. 
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Extended Data Fig. 7 | Probability that WHO GNT had been achieved in 2017 probability, respectively, of having met the WHO GNT in 2017. Given that there 
at the first administrative and 5 x 5-kmgrid-cell levels for stunting, wasting, | wasnoWHOGNTestablished for underweight, we based the underweight 
and underweight. a-f, Probability of WHOGNT achievementin2017 atthefirst | targeton WHOGNT for stunting, as the conditions are similarly widespread 
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Extended Data Fig. 8| Probability of meeting WHO GNT in 2025 at the first 
administrative and 5 x 5-km grid-cell levels for stunting, wasting, and 
underweight. a-f, Probability of WHO GNT achievement in 2025 at the first 
administrative and 5 x 5-km levels for stunting (a, d), wasting (b, e), and 
underweight (c, f). Dark-blue and dark-red grid cells indicate >95% and <5% 


probability, respectively, of meeting WHO GNT in 2025. Given that there was no 
WHO GNTestablished for underweight, we based the underweight target on 
WHOGNT for stunting as the conditions are similarly widespread and 
prevalent. Maps were produced using ArcGIS Desktop 10.6. 
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Extended Data Fig. 9 | Flowchart of CGF prevalence modelling process. The 
process used to produce CGF prevalence estimates in LMICs involved three 
main parts. In the data-processing steps (green), data were identified, 
extracted, and prepared for use in the models. In the modelling phase (red), we 
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spatiotemporal Gaussian process models for each CGF indicator. In post- 
processing (blue), we calibrated the prevalence estimates to match 2017 GBD 
study! estimates and aggregated the estimates to the first- and second- 
administrative-level units in each country. 
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Educational attainment is an important social determinant of maternal, newborn, and 
child health’ ?. As a tool for promoting gender equity, it has gained increasing traction 
in popular media, international aid strategies, and global agenda-setting* °®. The 


global health agenda is increasingly focused on evidence of precision public health, 
which illustrates the subnational distribution of disease and illness”®; however, an 
agenda focused on future equity must integrate comparable evidence onthe 
distribution of social determinants of health? ". Here we expand on the available 
precision SDG evidence by estimating the subnational distribution of educational 
attainment, including the proportions of individuals who have completed key levels 
of schooling, across all low- and middle-income countries from 2000 to 2017. Previous 
analyses have focused on geographical disparities in average attainment across Africa 
or for specific countries, but—to our knowledge—no analysis has examined the 
subnational proportions of individuals who completed specific levels of education 
across all low- and middle-income countries” “. By geolocating subnational data for 
more than 184 million person-years across 528 data sources, we precisely identify 
inequalities across geography as well as within populations. 


Education, as a social determinant of health, is closely linked to several 
facets of the Sustainable Development Goals (SDGs) of the United 
Nations. In addition to the explicit focus of SDG 4 on educational attain- 
ment, improved gender equality (SDG 5) and maternal, newborn, and 
child health (SDG 3) have well-documented associations with increased 
schooling» ”. In 2016, after years of deprioritization, aid to education 
reached its highest level since 2002". Despite this shift, only 22% of aid 
to basic education—defined as primary and lower-secondary—went to 
low-income countries in 2016 compared to 36% in 2002”. This reflects 
a persistent pattern in which the distribution of aid does not align with 
the greatest need, even at the national level. Beyond international aid, 
domestic policy is also a crucial tool for expanding access to educa- 
tion, especially at higher levels. However, policy-makers often do not 
have access to a rigorous evidence base at a subnational level. This 
analysis presents the subnational distribution of education to sup- 
port the growing evidence base of precision public health data, which 
shows widespread disparity of health outcomes as well as their social 
determinants. 


Mapping education across gender 


Despite widespread improvement in educational attainment since 
2000, gender disparity persists in 2017 in many regions. Figure 1 illus- 
trates the mean number of years of education and the proportion of 
individuals with no primary school attainment for men and women of 
reproductive age (15-49 years) in 2017. The average educational attain- 
mentis very low across much of the Sahel region of sub-Saharan Africa, 
consistent with previously published data”. In 2017, there was a large 
gender disparity in many regions, with men attaining higher average 


education across central and western sub-Saharan Africa and South 
Asia. Considerable variation remains between the highest- and lowest- 
performing administrative units within countries in 2017. For Ugandain 
2017, this indicator ranged from1.9 years of education (95% uncertainty 
interval, 0.8-3.0 years) in rural Kotido to 11.1 years (10.1-12 years) in 
Kampala, the capital city. Figure 1b, d displays the proportion of men 
and women aged 15-49 years who have not completed primary school. 
By considering the variation within populations in different locations, 
these maps help to identify areas with large populations in the vul- 
nerable lower end of the attainment distribution. We estimated large 
improvements in the proportions of individuals who have completed 
primary school in Mexico and China. However, across much of the 
world women in this age group failed to complete primary school ata 
much higher rate than their male counterparts. 

Despite continued lack of gender parity in education among the 
reproductive age group, vast progress towards parity has been made 
among the 20-24 age group. Extended Data Fig. 2 further examines 
gender parity in 2000 and 2017. This figure highlights two additional 
advantages of our analytic framework. First, we examined a younger 
group aged 20-24 years. Although education in this group is less 
directly relevant to maternal, newborn, and child health than educa- 
tionin the full window of reproductive age, these estimates allowed us 
to capture how the landscape of education has shifted over time (that 
is, across successive cohorts) and is therefore more likely to pick up 
improvements to access and retention in education systems that have 
been made since 2000. Second, we illustrate the probability that this 
estimated ratio is credibly different from 1 (parity between sexes) given 
the full uncertainty in our data and model. In 2000, we estimated that 
men completed schooling at a higher rate than women across much 


*A list of participants and their affiliations appears in the online version of the paper. 
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Fig. 1| Average educational attainment and proportion of individuals with 
nocompleted primary education at the first administrative level and 
absolute difference between women and men aged 15-49 years. a-d, Mean 


of the world, particularly for primary school education (that is, the 
probability that the parity ratio is greater than 1 was over 95%). This 
was true in most countries for both primary and secondary completion 
rates, but especially so in Burundi, Angola, Uganda, and Afghanistan 


educational attainment for women (a) and men (c) and the proportion of 
individuals with no primary school education for women (b) and men (d) aged 
15-49 years in 2017. Maps were produced using ArcGIS Desktop 10.6. 


(Extended Data Fig. 2a, c). By 2017, many countries moved significantly 
towards parity in both secondary and primary completion rates with 
the exception of large regions within central and western sub-Saharan 
Africa (Extended Data Fig. 2b, d). 
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Fig. 2| National progress in secondary attainment rates for women aged 20- 
24 years compared with the national index of dissimilarity in 2017. a, Change 
in secondary attainment rates for women age 20-24 years between 2000 and 
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2017 compared with the national index of dissimilarity in 2017 (simple linear 
regression lines are included). b, Map of the national index of dissimilarity in 
2017. Maps were produced using ArcGIS Desktop 10.6. 
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Fig. 3 | Attainment rates and contributions to national change in secondary 
rates for women aged 20-24 years in India and Nigeria, 2000-2017. 

a, b, Attainment rates for women aged 20-24 years in 2000 (a) and 2017 (b) at 
the second administrative level in India. c, Additive contributions of changesin 
the attainment rates at the second administrative level to change in the rate at 
the national level between 2000 and 2017 in India. d, e, Attainment rates for 


Inequalities within and between countries 


The subnational estimates of attainment presented here enable a closer 
examination of within-country inequality and associated trends over 
time. Figure 2 plots the national change in secondary attainment rates 
for women aged 20-24 years with the index of dissimilarity across sec- 
ond administrative-level units in 2017. The index of dissimilarity is an 
intuitive measure of geographical inequality that can be interpreted 
as the percentage of women with secondary attainment that would 
have to move in order to equalize secondary rates across all subna- 
tional districts. We estimated that countries that experienced more 
national progress over the period tended to be more spatially equal in 
2017. However, the top-right quadrant of the graph highlights several 
countries that experienced substantial national progress yet remain 
some of the most geographically unequal countries today. 

We further examined national progress between 2000 and 2017 in two 
such countries, India and Nigeria, where rates of secondary attainment 
increased from 10.9% (8.5-12.5%) to 37.2% (33.6-41.1%) and from 11.5% 
(6.2-18.3%) to 45.0% (37.0-52.5%), respectively (Fig. 3). The geographical 
distribution between two cohorts—women aged 20-24 years in 2000 
and 2017—was analysed by examining all proportions simultaneously 
(Fig. 3a, b). We estimate that there has been a massive shift towards 
primary and secondary completion coupled with greater geographical 
variability in completion rates (that is, spread of the dots that represent 
subnational units inthe legend). The majority of the 2017 cohort living 


c 2010-2017 


Contribution 


Primary >0.0020 


100 
0.0015 
0.0010 


0.0005 


ie) 


2017 f 2010-2017 


Contribution 
>0.0020 


0.0015 
0.0010 


fo} 

0S 
S 
S 


0.0005 
S 


ire} iS 


(e) 


women aged 20-24 years in 2000 and 2017 at the second administrative level in 
Nigeria. f, Additive contributions of changes in the attainment rates at the 
second administrative level to change in the rate at the national level between 
2000 and 2017 in Nigeria. On all ternary maps, the ‘Zero’ category includes all 
individuals with either no schooling or some primary schooling without 
completion. Maps were produced using ArcGIS Desktop 10.6. 


in the northwest and northeast of India never completed secondary 
school. Urban centres in the south, such as Bangalore and Mumbai, 
have seen considerable progress compared with more rural regions. In 
Nigeria, we estimate substantial national improvement; however, the 
country remained one of the most spatially unequal in 2017 (Fig. 3d, e). 
The more-urban south, particularly around Lagos, experienced much 
faster progress than the more-rural north. The implications of the popu- 
lation distribution were explored by decomposing the improvement in 
the national rate of secondary completion since 2000 for each country 
into the additive contributions of rate changes at the second adminis- 
trative level (Fig. 3c, f). This demonstrates that national progress was 
largely driven by improvements in populous urban regions (particularly 
Maharashtra, India, and Lagos, Nigeria), underscoring the importance 
of howsubnational progress (or lack thereof) contributes differentially 
to narratives surrounding national change. 


Discussion and limitations 

We have built on previous modelling efforts that focused on the geo- 
graphical distribution of average education” by extending our estima- 
tion to the distribution of attainment, highlighting not only average 
attainment but also the proportions of individuals who completed 
key levels of schooling that are central to policy efforts. As we dem- 
onstrate, throughout much of the world women lag behind their male 
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counterparts, and there is significant heterogeneity across subnational 
regions. Countries such as South Africa, Peru, and Colombia have seen 
tremendous improvement since 2000 in the proportion of the young 
adult population who have completed secondary school. As this trend 
continues, it will be important to focus not only on attainment but also 
on quality of education. However, many young women across the world 
still faced obstacles to attaining even a basic level of education in 2017 
(Extended Data Fig. 3). This represents a missed opportunity for the 
global health community to focus on a well-studied determinant of 
maternal, newborn, and child health. Even with only marginal returns 
to health in the short term, studies suggest that, on average, commu- 
nities will also see increased human capital, social mobility, and less 
engagement in child marriage or early childbearing”?°”. 

Children and adolescents do not complete formal schooling for 
many reasons. Many factors differentially affect girls, suchas cost, late 
or no school enrolment, forced withdrawal of married adolescents, 
and the social influence of family members concerning the traditional 
roles of girls and women*”°”?”3, A critical step is acknowledging that 
commercialization in the area of education typically leads to higher 
inequity. Treating public education as a societal good by increasing 
access, particularly in underserved rural communities, reduces inequal- 
ity. Identifying areas that are stagnating or worsening, particularly in 
the realm of basic education for young women across the world, is an 
important first step to targeted, long-term reform efforts that will ulti- 
mately have widespread benefits for equity in health and development. 

Many recent international calls to improve the social determinants of 
health have stated that measurement of inequity within countries is criti- 
calto understanding and tracking the problem, noting that geography is 
an increasingly important dimension of inequity™ *°. Where people are 
born greatly determines their life chances, and continuing to consider 
development and human capital formation on a national level is insuf- 
ficient”*. The goal of this analysis is to identify local areas that may have 
experienced negligible improvements, but further rigorous research 
is required to contextualize these patterns within the unique mix of 
structural obstacles that each community faces. There are many indirect 
costs for attending school and each disadvantaged area that we identify 
in our analysis may experience them in different ways. These include 
the demand for children to work, the opportunity or monetary costs 
of attending school, distance to school, lack of compulsory education 
requirements, high fees for attendance, political instability, and many 
other forces. Overcoming these obstacles to improve educational attain- 
ment alone will not necessarily result ina more-educated and healthy 
population for each country as highly educated individuals may be more 
likely to emigrate, resulting in ‘brain drain’. This is especially true for coun- 
tries that have been economically crippled over the past two decades 
and may lack the economic capacity to absorb a more highly educated 
labour force. Opening access to education will need to be coupled with 
economic reforms, both internationally and domestically, if countries 
are to fully experience dividends in human capital and health. 

Over the next decade of the SDG agenda, it will be important to 
maintain the progress that has been made to reprioritise investment 
in education systems. There remains an alarming lack of distributional 
accountability in aid, especially to basic education, for which most 
funding is not going to the countries that need it most”. Connections 
between educational attainment and health offer promising opportuni- 
ties for co-financing initiatives. For example, USAID recently invested 
US$90 million in HIV funding to the construction of secondary schools 
in sub-Saharan Africa. Global health leaders have noted the need to 
invest in precise data systems and eliminate data gaps to effectively 
target resources, develop equitable policy, and track accountability’. 
Our analysis provides a robust evidence base for such decision-making 
and advocacy. Decades of research on the effect of basic education 
on maternal, newborn, and child health positions this issue squarely 
in the purview of the global health agenda. It is crucial for the global 
health community to invest in long-term, sustainable improvement in 
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the underlying distribution of human capital, as this is the only way to 
truly influence health equity across generations. 
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Methods 


Overview 

Using a Bayesian model-based geostatistical framework and synthe- 
sizing geolocated data from 528 household and census datasets, this 
analysis provides subnational estimates of mean numbers years of 
education and the proportion of the population who attained key levels 
of education for women of reproductive age (15-49 years), women aged 
20-24 years, and equivalent male age bins between 2000 and 2017 in 
105 countries across all low- and middle-income countries (LMICs). 
Countries were selected for inclusion in this analysis using the socio- 
demographic index (SDI) published in the Global Burden of Disease 
(GBD) study”’. The SDI is a measure of development that combines 
education, fertility, and poverty. Countries in the middle, lower-middle, 
or low SDI quintiles were included, with several exceptions. Albania, 
Bosnia, and Moldova were excluded despite middle SDI status due to 
geographical discontinuity with other included countries and lack 
of available survey data. Libya, Malaysia, Panama, and Turkmenistan 
were included despite higher-middle SDI status to create better geo- 
graphical continuity. We did not analyse American Samoa, Federated 
States of Micronesia, Fiji, Kiribati, Marshall Islands, Samoa, Solomon 
Islands, or Tonga, where no available survey data could be sourced. 
Analytical steps are described below, and additional details can be 
found in the Supplementary Information. 


Data 

We compiled a database of survey and census datasets that contained 
geocoding of subnational administrative boundaries or GPS coordi- 
nates for sampled clusters. These included datasets from 528 sources 
(see Supplementary Table 2). These sources comprised at least one 
data source for all but two countries on our list of LMICs: Western 
Sahara and French Guiana. We chose to exclude these two countries 
from our analysis; 42 of 105 included countries have only subnational 
administrative level data. We extracted demographic, education, and 
sample design variables. The coding of educational attainment varies 
across survey families. In some surveys, the precise number of years of 
attainment is not provided, with attainment instead aggregated into 
categories such as ‘primary completion’ or ‘secondary completion’. 
In such cases, individuals who report ‘primary completion’ may have 
gone ontocomplete some portion of secondary education, but these 
additional years of education are not captured in the underlying data- 
set. Previous efforts to examine trends in mean years of education have 
either assumed that no additional years of education were completed 
(that is, primary education only) or have used the midpoint between 
primary and secondary education as a proxy”’. Trends in the single- 
year data, however, demonstrate that such assumptions introduce 
bias in the estimation of attainment trends over time and space, as 
differences in actual drop-out patterns or binning schema can lead to 
biased mean estimates”. 

For this analysis, we used a recently developed method that selects 
atraining subset of similar surveys across time and space to estimate 
the unobserved single-year distribution of binned datasets”’. In 
comprehensive tests of cross-validation that leveraged data for 
which the single-year distributions are observed, this algorithmic 
approach significantly reduces bias in summary statistics estimated 
from datasets with binned coding schemes compared to alternatives 
such as the standard-duration method”. The years in all coding 
schemes were mapped to the country- and year-specific references 
in the UNESCO International Standard Classification of Education 
(ISCED) for comparability*’. We used a top coding of 18 years on all 
data; this is acommon threshold in many surveys that have a cap 
and it is reasonable to assume that the importance of education 
for health outcomes (and other related SDGs) greatly diminishes 
after what is the equivalent of 2 to 3 years of graduate education 
in most systems. 


Data were aggregated to mean years of education attained and the 
proportions achieving key levels of education. The levels chosen were 
proportion with zero years, proportion with less than primary school 
(1-5 years of education), proportion with at least primary school 
(6-11 years of education), and proportion achieving secondary school 
or higher (12 or more years of education). A subset of the data fora 
smaller age bin (20-24 years) was also examined to more closely track 
temporal shifts. Equivalent age bins were aggregated for both women 
and men to examine disparities in mean years of attainment by sex. 
Where GPS coordinates were available, data were aggregated toa 
specific latitude and longitude assuming a simple-random sample, 
as the cluster is the primary sampling unit for the stratified design 
survey families, such as the Demographic and Health Survey (DHS) 
and Multiple Indicator Cluster Survey (MICS). Where only geographi- 
cal information was available at the level of administrative units, data 
were aggregated with appropriate weighting according to their sample 
design. Design effects were estimated using a package for analysing 
complex survey datain R™. 


Spatial covariates 

To leverage strength from locations with observations to the entire 
spatiotemporal domain, we compiled several 5 x 5-km’ raster layers 
of possible socioeconomic and environmental correlates of education 
(Supplementary Table 5 and Supplementary Fig. 6). Acquisition of 
temporally dynamic datasets, where possible, was prioritized to best 
match our observations and thus predict the changing dynamics of 
educational attainment. We included nine covariates indexed at the 
5x 5-km? level: access to roads, nighttime lights”, population", growing 
season, aridity, elevation, urbanicity”, irrigation, and year" (tv, time- 
varying covariates). More details, including plots of all covariates, can 
be found in the Supplementary Information. 

Our primary goal is to provide educational attainment predictions 
across LMICs at a high (local) resolution, and our methods provide 
the best out-of-sample predictive performance at the expense of 
inferential understanding. To select covariates and capture possible 
nonlinear effects and complex interactions between them, an ensem- 
ble covariate modelling method was implemented. For each region, 
three submodels were fitted to our outcomes using all of our covariate 
data: generalized additive models, boosted regression trees, and lasso 
regression. Each submodel was fit using fivefold cross-validation to 
avoid overfitting and the out-of-sample predictions from across the 
five folds were compiled into a single comprehensive set of predictions 
from that model. Additionally, the same submodels were also run using 
100% of the data and a full set of in-sample predictions were created. The 
five sets of out-of-sample submodel predictions were fed into the full 
geostatistical model as predictors when performing the model fit. The 
in-sample predictions from the submodels were used as the covariates 
when generating predictions using the fitted full geostatistical model. 
This methodology maximizes out-of-sample predictive performance 
at the expense of the ability to provide statistical inference on the rela- 
tionships between the predictors and the outcome. A recent study has 
shown that this ensemble approach can improve predictive validity by 
up to 25% over anindividual model”. More details on this approach can 
be found in the Supplementary Information. 

The primary goal of using the stacking procedure in our analyses was 
to maximize the predictive power of the raster covariates by capturing 
the nonlinear effects and complex interactions between covariates to 
optimize the model performance. It has previously been suggested” 
that the primary purpose of the submodel predictions is to improve 
the mean function of the Gaussian process. Although we have deter- 
mined a way to include the uncertainty from two of our submodels 
(lasso regression and generalized additive models (GAM)), we have not 
determined a way to include uncertainty from the boosted regression 
tree (BRT) submodel into our final estimates. Whereas GAM and lasso 
regression seek to fit a single model that best describes the relationship 


between response variable and someset of predictors, BRT method fits 
a large number of relatively simple models for which the predictions 
are then combined to give robust estimates of the response. Although 
this feature of the BRT model makes it a powerful tool for analysing 
complex data, quantifying the relative uncertainty contributed by each 
simple model as well as uncertainty from the complex interactions of 
the predictor variables is challenging”. It is worth noting, however, 
that our out-of-sample validation indicates that the 95% coverage is 
fairly accurate (for example, closely ranges around 95%) as shown in 
the figures and table of Supplementary Information section 4.3.2. This 
indicates that we are not misrepresenting the uncertainty in our final 
estimates. 


Analysis 
Geostatistical model. Gaussian and binomial data are modelled within 
a Bayesian hierarchical modelling framework using a spatially and 
temporally explicit hierarchical generalized linear regression model 
to fit the mean number years of education attainment and the propor- 
tion of the population who achieved key bins of school in 14 regions 
across all LMICs as defined in the GBD study (Extended Data Fig. 1). This 
means we fit 14 independent models for each indicator (for example, 
the proportion of women with zero years of schooling). GBD study 
design sought to create regions on the basis of three primary crite- 
ria: epidemiological homogeneity, sociodemographic similarity, and 
geographical contiguity”. Fitting our models by these regions has the 
advantage of allowing for some non-stationarity and non-isotropy in 
the spatial error term, compared to if we modelled one spatiotemporal 
random-effect structure over the entire modelling region of all LMICs. 
For each Gaussian indicator, we modelled the mean number of years 
of attainment in each survey cluster, d. Survey clusters are precisely 
located by their GPS coordinates and year of observation, which we map 
to a spatial raster location i at time t. We model the mean number of 
years of attainment as Gaussian data given fixed precision tand ascaling 
parameter s, (defined by the sample size in the observed cluster). As 
we may have observed multiple data clusters within a given location 
iat time ¢, we refer to the mean attainment, y, within a given cluster d 
by its indexed location i, and time tas pia «a. 
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For each binomial indicator, we modelled the number of individu- 
als at a given attainment level in each survey cluster, d. We observed 
the number of individuals reporting a given attainment level as bino- 
mial count data C, among an observed sample size N,. AS we may have 
observed multiple data clusters within a given location i at time t, we 
refer to the probability of attaining that level, p, within a given cluster 
dby its indexed location iand time tas pia,ca)- 
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We used a continuation-ratio modelling approach to account for 
the ordinal data structure of the binomial indicators®. To do this, the 
proportion of the population with zero years of education was modelled 
using a binomial model. The proportion with less than primary educa- 
tion was modelled as those with less than primary education of those 
that have more than zero years of education. The same method followed 
for the proportion of population completing primary education. The 


proportion achieving secondary school or higher was estimated as the 
complement of the sum of the three binomial models. 

The remaining parameter specification was consistent between all 
indicators in both binomial and Gaussian models: 
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For indices d, i, and t, *(index) is the value of * at that index. The prob- 
abilities p,;, represent both the annual proportions at the space-time 
location and the probability that an individual had that level of attain- 
ment given that they lived at that particular location. The annual prob- 
ability p;,of each indicator (or y,,for the mean indicators) was modelled 
as a linear combination of the three submodels (GAM, BRT, and lasso 
regression), rasterized covariate values X, ,,a correlated spatiotempo- 
ral error term Z,,,country random effects €,,,,;, with one unstructured 
country random effect fit for each country in the modelling region and 
all sharing acommon variance parameter, y’, and an independent nug- 
get effect e, ,with variance parameter o*. Coefficients f, in the three 
submodels hf = 1, 2,3 represent their respective predictive weighting 
in the mean logit link, while the joint error term Z,,accounts for resid- 
ual spatiotemporal autocorrelation between individual data points 
that remains after accounting for the predictive effect of the submodel 
covariates, the country-level random effect €,,,;, and the nugget inde- 
pendenterror term, e€; ,. The purpose of the country-level random 
effect is to capture spatially unstructured, unobserved country-specific 
variables, as there are often sharp discontinuities in educational attain- 
ment between adjacent countries due to systematic differences in 
governance, infrastructure, and social policies. 

The residuals Z,,are modelled as a three-dimensional Gaussian process 
(GP) in space-time centred at zero and with a covariance matrix con- 
structed froma Kronecker product of spatial and temporal covariance 
kernels. The spatial covariance 2*"** is modelled using an isotropic and 
stationary Matérn function*®, and temporal covariance x""’ as an annual 
autoregressive (AR1) function over the 18 years represented in the model. 
In the stationary Matérn function, [is the Gamma function, K, is the 
modified Bessel function of order v> 0, x > Ois ascaling parameter, 
D denotes the Euclidean distance, and w” is the marginal variance. The 
scaling parameter, x, is defined to be x= /8v/6 where 6 is a range 
parameter (which is about the distance for which the covariance function 
approaches 0.1) and visa scaling constant, which is set to 2 rather than 
fit from the data®”**. This parameter is difficult to reliably fit, as docu- 
mented by many other analyses*”***° that set this parameter to 2. 
The number of rows and the number of columns of the spatial Matérn 
covariance matrix are equal to the number of spatial mesh points fora 
given modelling region. Inthe ARI function, pis the autocorrelation func- 
tion (ACF), and kand/are points inthe time series where |k—j| defines the 
lag. Thenumber of rows andthe number of columns of the ARI covariance 
matrix are equal to the number of temporal mesh points (18). The number 
ofrowsand the number of columns of the space-time covariance matrix, 
rspace @ ym For a given modelling region are equal to: the number of 
spatial mesh points x the number of temporal mesh points. 
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This approach leveraged the residual correlation structure of the data 
to more accurately predict estimates for locations with no data, while 
also propagating the dependence in the data through to uncertainty 
estimates*. The posterior distributions were fit using computationally 
efficient and accurate approximations in R-integrated nested Laplace 
approximation (INLA) with the stochastic partial differential equations 
(SPDE) approximation to the Gaussian process residuals using R project 
version 3.5.1 *. The SPDE approach using INLA has been demonstrated 
elsewhere, including the estimation of health indicators, particulate air 
matter, and population age structure” **”, Uncertainty intervals were 
generated from 1,000 draws (that is, statistically plausible candidate 
maps)*’ created from the posterior-estimated distributions of mod- 
elled parameters. Additional details regarding model and estimation 
processes can be found in the Supplementary Information. 

Totransform grid cell-level estimates into a range of information that 
is useful to a wide constituency of potential users, these estimates were 
aggregated from the 1,000 candidate maps up to district, provincial, 
and national levels using 5 x 5-km? population data”. This aggregation 
also enabled the calibration of estimates to national GBD estimates for 
2000-2017. This was achieved by calculating the ratio of the posterior 
mean national-level estimate from each candidate map draw in the 
analysis to the posterior mean national estimates from GBD, and then 
multiplying each cell in the posterior sample by this ratio. National- 
level estimates from this analysis with GBD estimates can be found in 
Supplementary Table 44. 

To illustrate how subnational progress has contributed differentially 
to national progress (Fig. 3), we decomposed the improvement in the 
national rate of secondary completion since 2000 for each country 
into the additive contributions of rate changes at the second admin- 
istrative level, where Cis the national secondary rate change, Nis the 
total number of second-level administrative units, c;is the population 
proportion in administrative unit i, and r,is the rate of secondary attain- 
ment in administrative unit i. 


N 
C= > (Ci 2017F, 2017) ~ (C;,2000%,2000) 
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Although the model can predict at all locations covered by avail- 
able raster covariates, all final model outputs for which land cover was 
classified as ‘barren or sparsely vegetated’ were masked, on the basis 
of the most recently available Moderate Resolution Imaging Spectro- 
radiometer (MODIS) satellite data (2013), as well as areas in which the 
total population density was less than 10 individuals per 1 x 1-km? pixel 
in 2015°°. This step has led to improved understanding when commu- 
nicating with data specialists and policy-makers. 


Model validation. Models were validated using source-stratified five- 
fold cross-validation. To offer a more stringent analysis by respecting 
some of the source and spatial correlation in the data, holdout sets 
were created by combining sets of data sources (for example, entire 
survey- or census-years). Model performance was summarized by the 
bias (mean error), total variance (root-mean-square error) and 95% 
data coverage within prediction intervals, and the correlation between 
observed data and predictions. All validation metrics were calculated 
onthe predictions from the fivefold cross-validation. Where possible, 
estimates from these models were compared against other existing esti- 
mates. Furthermore, measures of spatial and temporal autocorrelation 
pre- and post-modelling were examined to verify correct recognition, 
fitting, and accounting for the complex spatiotemporal correlation 
structure in the data. All validation procedures and corresponding 
results are provided in the Supplementary Information. 


Limitations. Our analysis is not without several important limitations. 
First, almost all data collection tools conflate gender and sex and we 
therefore do not capture the full distribution of sex or gender separately 


in our data. We refer throughout to the measurement of ‘gender (in) 
equality’, following the usage in SDG5. Second, it is extremely difficult to 
quantify quality of education on this scale ina comparable way. Quality 
is ultimately a large part of the SDG agenda and of utmost importance 
to achieving equity in opportunity for social mobility. However, many 
studies across diverse low- and middle-income settings have linked at- 
tainment, even very low levels, to measurable improvement in maternal 
and child health”. As our analysis highlights with the proportional indi- 
cators, there are still many subnational regions across the world where 
large proportions do not complete primary school. A third limitation is 
that we are unable to measure or account for migration. A concept note 
released from the forthcoming Global Education Monitoring Report 
2019 focuses on how migration and displacement affects schooling”. 
Our estimates of the modelled outcome, educational attainment for 
a particular space-time-age-sex, are demonstrated to be statistically 
unbiased (Supplementary Information section 4.3); however, interpre- 
tation of any change in attainment as a change in the underlying educa- 
tion system could potentially be biased by the effects of migration. It 
is possible that geographical disparities reflect changes in population 
composition rather than changes in the underlying infrastructure or 
education system. Pathways for this change are complex and may be vol- 
untary. Those who manage to receive an education in alow-attainment 
area may have an increased ability to migrate and choose to doso. This 
change may also be involuntary, particularly in politically unstable 
areas where displacement may make geographical changes over time 
difficult to estimate. A shifting population composition is a general 
limitation of many longitudinal ecological analyses, but the spatially 
granular nature of the analyses used here may be more sensitive to the 
effects of mobile populations. 

Our analysis is purely predictive but draws heavily in its motivation 
froma rich history of literature on the role of education in reducing 
maternal mortality, improving child health, and increasing human 
capital. Studies have also demonstrated complex relationships between 
increased education and a myriad of positive health outcomes, such 
as HIV risk reductions and spillover effects to other household mem- 
bers>**, The vast majority of these studies are associational and recent 
attempts at causal analyses have provided more-mixed evidence. 
Although causal analyses of education are very difficult and often rely 
on situational quasi-experiments, associational analyses using the 
most comprehensive datasets demonstrate consistent support for the 
connection between education and health”. Looking towards future 
analyses, it will be important to study patterns of change in these data 
and how they overlap with distributions of health. Lastly, our estimates 
cannot be seen as a replacement for proper data collection systems, 
especially for tracking contemporaneous change. Our analysis of uncer- 
tainty at a high-resolution may be used to inform investment in more 
robust data systems and collection efforts, especially if the ultimate 
goal is to measure and track progress in the quality of schooling. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The findings of this study are supported by data that are available in 
public online repositories, data that are publicly available upon request 
from the data provider, and data that are not publicly available owing to 
restrictions by the data provider, which were used under license for the 
current study, but may be available from the authors upon reasonable 
request and permission of the data provider. A detailed table of data 
sources and availability can be found in Supplementary Table 2. Interac- 
tive visualization tools are available at https://vizhub.healthdata.org/ 
Ibd/education. All maps presented in this study are generated by the 
authors; no permissions are required for publication. Administrative 


boundaries were retrieved from the Global Administrative Unit Layers 
(GAUL) dataset, implemented by FAO within the CountrySTAT and 
Agricultural Market Information System (AMIS) projects®®. Land cover 
was retrieved from the online Data Pool, courtesy of the NASA EOSDIS 
Land Processes Distributed Active Archive Center (LP DAAC), USGS/ 
Earth Resources Observation and Science (EROS) Center, Sioux Falls, 
South Dakota®. Lakes were retrieved from the Global Lakes and Wet- 
lands Database (GLWD), courtesy of the World Wildlife Fund and the 
Center for Environmental Systems Research, University of Kassel>?°. 
Populations were retrieved from WorldPop**”. All maps were produced 
using ArcGIS Desktop 10.6. 


Code availability 


Our study follows the Guidelines for Accurate and Transparent Health 
Estimates Reporting (GATHER). All code used for these analyses is avail- 
able online at http://ghdx.healthdata.org/record/ihme-data/Imic- 
education-geospatial-estimates-2000-2017, and at http://github.com/ 
ihmeuw/Ibd/tree/edu-Imic-2019. 


27. GBD 2016 Causes of Death Collaborators. Global, regional, and national age-sex specific 
mortality for 264 causes of death, 1980-2016: a systematic analysis for the Global Burden 
of Disease Study 2016. Lancet 390, 1151-1210 (2017). 

28. Barro, R. J. & Lee, J. W. A new data set of educational attainment in the world, 1950-2010. 
J. Dev. Econ. 104, 184-198 (2013). 

29. Friedman, J., Graetz, N. & Gakidou, E. Improving the estimation of educational attainment: 
new methods for assessing average years of schooling from binned data. PLoS ONE 13, 
e€0208019 (2018). 

30. UNESCO. ISCED Mappings (UNESCO, 2016); http://uis.unesco.org/en/isced-mappings 

31. Lumley, T. survey: analysis of complex survey samples. R package v.3.36 
https://cran.r-project.org/web/packages/survey.pdf (2019). 

32. Bhatt, S. et al. Improved prediction accuracy for disease risk mapping using Gaussian 
process stacked generalization. J. R. Soc. Interface 14, 20170520 (2017). 

33. Elith, J., Leathwick, J. R. & Hastie, T. A working guide to boosted regression trees. J. Anim. 
Ecol. 77, 802-813 (2008). 

34. Leathwick, J., Elith, J., Francis, M., Hastie, T. & Taylor, P. Variation in demersal fish species 
richness in the oceans surrounding New Zealand: an analysis using boosted regression 
trees. Mar. Ecol. Prog. Ser. 321, 267-281 (2006). 

35. Hosmer, D. W., & Lemeshow, S. in Applied Logistic Regression 289-305 (Wiley, 2013). 

36. Stein, M.L. Interpolation of Spatial Data (Springer, 1999). 

37. Lindgren, F. & Rue, H. Bayesian spatial modelling with R-INLA. J. Stat. Softw. 63, 1-25 
(2015). 

38. Lindgren, F., Rue, H. & Lindstr6m, J. An explicit link between Gaussian fields and Gaussian 
Markov random fields: the stochastic partial differential equation approach. J. R. Stat. 
Soc. B73, 423-498 (2011). 

39. Rozanoyv, Y. A. in Markov Random Fields 55-102 (Springer, 1982). 

40. Whittle, P. On stationary processes in the plane. Biometrika 41, 434-449 (1954). 

41. Diggle, P. & Ribeiro, P. J. Model-based Geostatistics (Springer, 2007). 

42. Rue, H., Martino, S. & Chopin, N. Approximate Bayesian inference for latent Gaussian 
models by using integrated nested Laplace approximations. J. R. Stat. Soc. B 71, 319-392 
(2009). 

43. Rue, H. et al. Bayesian Computing with INLA (2014); http://www.r-inla.org/ 

44. Blangiardo, M., Cameletti, M., Baio, G. & Rue, H. Spatial and spatio-temporal models with 
R-INLA. Spat. Spatiotemporal Epidemiol. 7, 39-55 (2013). 

45. Krainski, E. T., Lindgren, F., Simpson, D. & Rue, H. The R-INLA Tutorial on SPDE Models 
(2017); https://inla.r-inla-download.org/r-inla.org/tutorials/spde/spde-tutorial. pdf 


46. Cameletti, M., Lindgren, F., Simpson, D. & Rue, H. Spatio-temporal modeling of 
particulate matter concentration through the SPDE approach. Adv. Stat. Anal. 97, 109-131 
(2013). 

47. Alegana, V. A. et al. Fine resolution mapping of population age-structures for health and 
development applications. J. R. Soc. Interface 12, 20150073 (2015). 

48. Patil, A.P., Gething, P. W., Piel, F. B. & Hay, S. |. Bayesian geostatistics in health 
cartography: the perspective of malaria. Trends Parasitol. 27, 246-253 (2011). 

49. Tatem, A. J. WorldPop, open data for spatial demography. Sci. Data 4, 170004 
(2017). 

50. Friedl, M. et al. MCD12Q2 vOO6. MODIS/Terra+tAqua Land Cover Type Yearly L3 Global 
500 m SIN Grid. (NASA EOSDIS Land Processes DAAC, 2019); https://doi.org/10.5067/ 
MODIS/MCD12Q1.006 

51. Global Education Monitoring Report. Concept Note for the 2019 Global Education 
Monitoring Report on Education and Migration (2017). 

52. Behrman, J. A. The effect of increased primary schooling on adult women’s HIV status in 
Malawi and Uganda: universal primary education as a natural experiment. Soc. Sci. Med. 
127, 108-115 (2015). 

53. De Neve, J.-W., Fink, G., Subramanian, S. V., Moyo, S. & Bor, J. Length of secondary 
schooling and risk of HIV infection in Botswana: evidence from a natural experiment. 
Lancet Glob. Health 3, e470-e477 (2015). 

54. McCrary, J. & Royer, H. The effect of female education on fertility and infant health: 
evidence from school entry policies using exact date of birth. Am. Econ. Rev. 101, 158-195 
(2011). 

55. Karlsson, O., De Neve, J.-W. & Subramanian, S. V. Weakening association of parental 

education: analysis of child health outcomes in 43 low- and middle-income countries. 

Int. J. Epidemiol. 48, 83-97 (2019). 

56. De Neve, J.-W. & Fink, G. Children’s education and parental old age survival — 

quasi-experimental evidence on the intergenerational effects of human capital 

investment. J. Health Econ. 58, 76-89 (2018). 

57. Pamuk, E.R., Fuchs, R. & Lutz, W. Comparing relative effects of education and economic 

resources on infant mortality in developing countries. Popul. Dev. Rev. 37, 637-664 

(2011). 

58. FAO-UN. The Global Administrative Unit Layers (GAUL) (2015); http://www.fao.org/ 

geonetwork/srv/en/metadata.show?id=12691 

59. World Wildlife Fund. Global Lakes and Wetlands Database, Level 3 (2004); https://www. 

worldwildlife.org/pages/global-lakes-and-wetlands-database 

60. Lehner, B. & Dall, P. Development and validation of a global database of lakes, reservoirs 
and wetlands. J. Hydrol. 296, 1-22 (2004). 

61. World Pop. Data Types (accessed: 7th July 2017); https://www.worldpop.org/project/list 

62. Channan, S., Collins, K. & Emanuel, W. Global mosaics of the standard MODIS land cover 
type data. University of Maryland and the Pacific Northwest National Laboratory, College 
Park, Maryland, USA. (2014) 


Acknowledgements This work was primarily supported by grant OPP1132415 from the Bill & 
Melinda Gates Foundation. N.G. is the recipient of a training grant from the National Institute of 
Child Health and Human Development (T32 HD-007242-36A1). 


Author contributions S.|.H. and N.G. conceived and planned the study. K.W. and J.H. extracted, 
processed, and geo-positioned the data. L.W. and N.G. carried out the statistical analyses. All 
authors provided intellectual inputs into aspects of this study. N.G., L.W., J.H., and L.E. 
prepared figures and tables. N.G. wrote the manuscript with assistance by S.B.M., and all 
authors contributed to subsequent revisions. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-019- 
1872-1. 

Correspondence and requests for materials should be addressed to S.I.H. 

Peer review information Nature thanks M. Dolores Ugarte and the other, anonymous, 
reviewer(s) for their contribution to the peer review of this work. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


Article 


~ 
MBG Modelling Regions 
GEE xscecn south America (ansa) 


GEE) central America and the Caribbean (caca) 


| | Central Asia (stan) 
(EG contrat sub-Saharan Africa (essa) 
[EBB E01 asic (eco) 
: [EEE] cestern sub-Sarahan Africa (essa) 
] Middle East (mide) 
North Africa (noaf) 


BE ccna fc80} 
BB) seth as fn 
BEB scutes 


[EEG southern sub-Saharan Africa (sss) 


[EJ tropical south America (11a) 


= Western Sub-Saharan Africa (wssa) 


[EE] stage 3 
Extended Data Fig. 1| Modelling regions based on geographical and SDI Oceania, Southeast Asia, South Asia, southern sub-Saharan Africa, Central 
regions from the GBD. Modelling regions were defined as follows. Andean Asia, Tropical South America, and western sub-Saharan Africa. Regions in grey 
South America, Central America and the Caribbean, central sub-Saharan were not included in our models due to high-middle and high SDIs””. The map 


Africa, East Asia, eastern sub-Saharan Africa, Middle East, North Africa, was produced using ArcGIS Desktop 10.6. 


A 


aé 

- v @ 
Se) 
uf 


Probability Males 
- | have greater primary 
education (20-24) 


Probability Males 
have greater primary 
education (20-24) 
° 
goa 
§ 5 


1. | 


ee ae a Tey 
Secondary £e 


Sg ats 

oT , 
Ly 
a: - 0 , te 
Ess t 
2SB5 y 
fiz, fg 
a |_| cS 

oy 


Secondary AS 


ae 


Probability Males have 
greater secondary 
education (20-24) 


a 

A 
Pee 
ww 


Extended Data Fig. 2| Probability that the ratio of mento women aged 
20-24 years who attained primary and secondary education is >1in 2000 
and 2017. a—d, Probability that ratio is >1 (for example, men complete ata 


higher rate than women) for attaining primary education (a, b) and secondary 
education (c, d), aggregated to first administrative-level units in 2000 (a,c) and 
2017 (b, d). Maps were produced using ArcGIS Desktop 10.6. 


Article 


Mean years of educations 
(females, 20-24) 


Mean years of educatio 
(males, 20-24) 


Difference in mean 
years of education 


Extended Data Fig. 3 | Average educational attainment and proportion with education (f). Maps reflect administrative boundaries, land cover, lakes and 


no primary school at the first administrative level and absolute difference population; grey-coloured grid cells were classified as ‘barren or sparsely 
between women and menaged 20-24 years. a-d, Average educational vegetated’ and had fewer than ten people per 1x 1-km’ grid cell*”** ©", or were 
attainment for women (a) and men (c) and proportion with no primary school not included in these analyses. Interactive visualization tools are available at 
for women (b) and men (d) aged 20-24 years in 2017. e, f, The absolute https://vizhub.healthdata.org/Ibd/education. Maps were produced using 
difference in average educational attainment between men and women aged ArcGIS Desktop 10.6. 


20-24 years in 2017 (e) and proportion of individuals with no primary school 


nature research Corresponding author(s): Simon |.Hay and Nick Graetz 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Clearly defined error bars 
State explicitly what error bars represent (e.g. SD, SE, Cl) 


Our web collection on statistics for biologists may be useful. 


Software and code 


Policy information about availability of computer code 


Data collection No primary data collection was carried out for this analysis. 
Data analysis This analysis was carried out using R version 3.5.0. The main geostatistical models were fit using R-INLA version 18.07.12. All code used 
for these analyses is publicly available online at http://ghdx.healthdata.org/. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The findings of this study are supported by data that are available in public online repositories, data that are publicly available upon request from the data provider, 
and data that are not publicly available due to restrictions by the data provider and which were used under license for the current study. A detailed table of data 


=) 
fed) 
a 
iS 
= 
a) 
= 
a) 
Za) 
ie) 
ied) 
= 
(i) 
=i 
= 
a) 
ae) 
fe) 
a. 
=) 
© 
Za) 
= 
3 
= 
ied) 
= 
~< 


sources and availability can be found in the Supplemental Information. 


Administrative boundaries were retrieved from the Global Administrative Unit Layers (GAUL) dataset, implemented by FAO within the CountrySTAT and Agricultural 
Market Information System (AMIS) projects. Land cover was retrieved from the online Data Pool, courtesy of the NASA EOSDIS Land Processes Distributed Active 
Archive Center (LP DAAC), USGS/Earth Resources Observation and Science (EROS) Center, Sioux Falls, South Dakota. Lakes were retrieved from the Global Lakes and 
Wetlands Database (GLWD), courtesy of the World Wildlife Fund and the Center for Environmental Systems Research, University of Kassel. Populations were 
retrieved from WorldPop. 


Field-specific reporting 


Please select the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


xX] Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/authors/policies/ReportingSummary-flat.pdf 


=) 
ied) 
a 
iS 
= 
a) 
= 
a) 
Za) 
a) 
ied) 
= 
(i) 
=H 
= 
a) 
ae) 
fo) 
a. 
=) 
© 
Wn 
= 
3 
3 
ied) 
= 
~< 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample size was calculated as the number of unique data source location-pairs with observations of years of educational attainment for males 
and females. This sample size is reported in the methods section as "We compiled a database of survey and census datasets that contained 
geocoding of subnational administrative boundaries or GPS coordinates for sampled clusters. These included datasets from 528 sources (see 
Supplementary Table 2). 


Data exclusions As described n the methods section of the main text (with greater detail in the Supplementary Information) we did not include data from 
Western Sahara or French Guiana, due to lack of availability. 42 of 105 included countries have only subnational administrative level data. 
Employing a Bayesian model-based geostatistical framework and synthesizing geolocated data from 517 household and census datasets, this 
analysis provides subnational estimates of mean years of education and proportion of the population attaining key levels of education for 
women of reproductive age (15-49), women age 20-24, and equivalent male age-bins between 2000-2017 in low- and middle-income 
countries. This includes 105 countries across all low- and middle-income countries. Countries were selected for inclusion in this analysis using 
the Socio-demographic Index (SDI) published in the GBD46. The SDI is a measure of development that combines education, fertility, and 
poverty. Countries in the Middle, Lower-Middle, or Low SDI quintiles were included, with several exceptions. Albania, Bosnia, and Moldova 
were excluded despite Middle SDI status due to geographic discontinuity with other included countries and lack of available survey data. 
Libya, Malaysia, Panama, and Turkmenistan were included despite Higher-Middle SDI status to create better geographic continuity. We do not 
estimate for American Samoa, Federated States of Micronesia, Fiji, Kiribati, Marshall Islands, Samoa, Solomon Islands, or Tonga, where no 
available survey data could be sourced. 


Replication This is an observational study using many years of survey and surveillance data and could be replicated. 
Randomization _ This analysis is an observational mapping study and there were no experimental groups. 


Blinding Blinding was not relevant to this study, as it was an observational study using survey and surveillance data. 


Reporting for specific materials, systems and methods 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Unique biological materials ChIP-seq 
Antibodies Flow cytometry 
Eukaryotic cell lines MRI-based neuroimaging 


Palaeontology 


Animals and other organisms 


Human research participants 


Article 


Internal state dynamics shape brainwide 
activity and foraging behaviour 


https://doi.org/10.1038/s41586-019-1858-z 


Received: 7 December 2018 


Joao C. Marques'”“, Meng Li’*“, Diane Schaak', Drew N. Robson'** & Jennifer M. Li'** 


Accepted: 18 November 2019 


Published online: 18 December 2019 


The brain has persistent internal states that can modulate every aspect of an animal’s 
mental experience! *. In complex tasks such as foraging, the internal state is 


dynamic? *. Caenorhabditis elegans alternate between local search and global 
dispersal°. Rodents and primates exhibit trade-offs between exploitation and 
exploration®’. However, fundamental questions remain about how persistent states 
are maintained in the brain, which upstream networks drive state transitions and how 
state-encoding neurons exert neuromodulatory effects on sensory perception and 
decision-making to govern appropriate behaviour. Here, using tracking microscopy 
to monitor whole-brain neuronal activity at cellular resolution in freely moving 
zebrafish larvae’, we show that zebrafish spontaneously alternate between two 
persistent internal states during foraging for live prey (Paramecia). Inthe exploitation 
state, the animal inhibits locomotion and promotes hunting, generating small, 
localized trajectories. In the exploration state, the animal promotes locomotion and 
suppresses hunting, generating long-ranging trajectories that enhance spatial 
dispersion. We uncover a dorsal raphe subpopulation with persistent activity that 
robustly encodes the exploitation state. The exploitation-state-encoding neurons, 
together with a multimodal trigger network that is associated with state transitions, 
forma stochastically activated nonlinear dynamical system. The activity of this 
oscillatory network correlates with a global retuning of sensorimotor 
transformations during foraging that leads to marked changes in both the motivation 
to hunt for prey and the accuracy of motor sequences during hunting. This work 
reveals an important hidden variable that shapes the temporal structure of 
motivation and decision-making. 


We placed freely moving zebrafish larvae in a behavioural arena 
(35 mm in diameter) with Paramecia (Fig. 1, Methods), and simultane- 
ously recorded behaviour and whole-brain neural activity at cellular 
resolution for 50 to 80 min. We continuously tracked fish position 
and heading, eye orientation, tail curvature and Paramecia positions 
(Fig. 1a, Extended Data Fig. la-c, Supplementary Video 1, Methods). 
Each motor movementis classified into one of seven observed movement 
types’°” (Extended Data Fig. 1d-i, Supplementary Video 2, Methods). 
We manually annotated the start, end and outcome of each hunting 
attempt. Each hunting sequence begins with eye convergence (Extended 
Data Fig. 1b, c), in which the eyes—which are ordinarily parallel—reorient 
towards the midline to enable prey pursuit”. The animal then pursues the 
prey using high-precision turns (J-turns) and swims (approach swims)”, 
and attempts to ingest the prey with capture swims“, J-turns, high-angle 
turns or routine turns (Extended Data Fig. 2a-f). Every hunting sequence 
ends with one of four possible outcomes—success, miss, abort or reject 
(Extended Data Fig. 2a, d-f, Supplementary Video 1, Methods). 

To investigate the overall dynamics of the behaviour of the animal 
across time, we defined a metric for exploratory drive as the diameter of 


the smallest circle enclosing the animal's trajectory in 5-s bins, which we 
call dispersal distance (Fig. 1a). We fit a hidden Markov model (HMM) to 
dispersal distance to define two behavioural states (Fig. 1b-d, Methods). 
Dispersal distance has a bimodal distribution (Extended Data 
Fig. 3a), with substantially higher dispersal in the exploration state 
(9.6+2.5mm, mean +s.d.,n=36 animals) than inthe exploitation state 
(2.3 +1.3 mm, n= 36 animals). On average, the exploitation state per- 
sists for 7.1+ 3.9 min, and exploration state persists for 5.5 + 6.2 min 
(mean + s.d., n = 36 animals). The duration of each state follows an 
exponential distribution (A =0.17 min“ for exploitation, A=0.22 min™ 
for exploration, pooled distribution of n=36 animals) (Fig. 1d, Extended 
Data Fig. 3g). 

Toindependently confirm our choice of behavioural states, we assem- 
bled acomprehensive set of ten behavioural parameters, suchas the rate 
of each movementtype, and the rate of hunting initiations and outcomes 
(Methods). Thoughsome individual behavioural parameters may becon- 
tinuously distributed across the HMM states (Extended Data Fig. 3b, c), 
principal component analysis (PCA) of all behavioural parameters 
(Fig. le, Extended Data Fig. 3d-f, Methods) confirms the existence of 
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Fig. 1| Freely moving zebrafish larvae display behavioural-state oscillations 
during foraging. a, Zebrafish larvae hunt for Paramecia ina behavioural arena 
(left) while we track the prey (top right, yellow), fish tail curvature (top right, 
red), heading vector (bottom right, magenta) and eye orientation (bottom 
right, cyan). Dispersal distance is defined as the diameter of the smallest circle 
encompassing the trajectory of the fish over 5s. b, Anexample of fish position 
and dispersal distance across one hour, representative of n=36 animals. 


The histogram shows the distribution of the first principal component (PC1) 
colour-coded by whether the observation was classified as exploration (blue) 
or exploitation (red) by the HMM. f-h, State-dependent differences inthe 
conditional probability (Extended Data Fig. 2t) of exploration-related 
movements (f), hunting-related movements (g) and hunting outcomes (h). 
*P<0.05,**P<0.01,***P<0.001.n=25 animals, 2,972 hunting attempts and 
88,814 movements. i,j, Time-aligned averages (mean +s.e.m.) of behavioural 


Shaded regions depict exploitation state (red) and exploration state (blue) metrics after aligning to HMM-inferred transitions from exploitation to 
inferred bya HMM (Methods). A raster of hunting events is shown (bottom, exploration, including dispersal distance (i, blue), hunting rate (i, red), 

black ticks). c, Representative 5-min movement trajectories in the behavioural conditional probability of success per hunting initiation (j, orange) and 

arena during exploitation state (top) and exploration state (bottom). d, State conditional probability of miss per hunting initiation (j, teal) (2 =36 animals). 
duration distributions for exploitation (top) and exploration (bottom) (n=36 k, Hunting behaviour as a Markov decision process. Nodes represent decision 
animals), measured in two-minute bins (black dots) anda maximum likelihood points in the hunting sequence. Arrows show state-dependent transition 
exponential model fit (red and blue lines) to the underlying duration data. probability in exploitation state (red) and exploration state (blue). 


e, PCA was performed using ten behavioural variables (Methods). 


two opposing behavioural states. Most of the behavioural variance was The exploration and exploitation states define an axis for global 
explained by the first principal component (Extended Data Fig.3e), | changes in behaviour (Fig. 1f-k, Extended Data Figs. 2, 3), including 
which has a bimodal distribution that matches the hiddenMarkovstates _huntinginitiation rate (Fig. 1g, i, k), the conditional probability of hunt- 
inferred from dispersal distance (Fig. le, Extended Data Fig. 3d). ing success per hunting initiation (Fig. 1h, j), the Markov transition 
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Fig. 3 | Neuromodulation of sensorimotor activity by dorsal raphe state 
neurons. a-d, Normalized dorsal raphe state activity is correlated with 
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e, f, Spatial density maps of sensorimotor neurons (e) and eye-convergence 
neurons (f) with positive (left, red) or negative (right, blue) correlation to 
normalized dorsal raphe state activity across all hunting events. Colour maps 
indicate total number of neurons after projection across 17 animals. Colour 
bars ine, findicate number of neuronal centroids. V, ventral; D, dorsal. 

g-k, Left, event-triggered activity (average from Osto+2s from hunting 
initiation) as a function of normalized dorsal raphe state activity. Slope of the 
regression fit is shown on the plot. **P< 0.01, ***P< 0.001; two-sample t-test. 


probabilities throughout the hunting sequence (Fig. 1k), and the con- 
ditional probability of each movement type (Fig. 1f, g, Extended Data 
Figs. 2t, 3c). 

During exploration, an increase in overall movement rate and the 
probability of high-speed turns (routine turns) and swims (type 2 slow 
swims) (Fig. 1f, Extended Data Fig. 3c) combine to promote spatial dis- 
persal. By contrast, during exploitation, the probability of hunting- 
related movements such as eye convergence, J-turns, approach swims 
and capture swims significantly increases (Fig. 1g, Extended Data Figs. 2t, 
3c). The increased probability of hunting sequences that culminate in 
a capture swim in the exploitation state (Extended Data Fig. 2s) con- 
tributes to increased hunting success (Extended Data Fig. 2g, j, m, p). 
There is a strong inverse relationship between movement rate and the 
probability of hunting-related movements (Extended Data Figs. 2u, 3c). 

We found that behavioural-state transitions are not solely dependent 
onchanges in satiety. Larvae placed ina behavioural arena without prey, 
which prevents fluctuations in satiety owing to prey consumption, con- 
tinue to exhibit clear alternations in global dispersal and local search 
(Extended Data Fig. 4a-d). This suggests an intrinsic mechanism that 
can drive spontaneous oscillations in motivation and decision-making 
during foraging. Next, we compared larvae that were fed until the start 
of the experiment with siblings that were starved for 24 h (Extended 
Data Fig. 4e, f). Starved animals spent more time in the exploitation 
state for the first 10 min of the experiment, but this effect was not statis- 
tically significant (Extended Data Fig. 4e). Thus, although it is possible 
that hunger may weakly modulate exploitation- and exploration-state 
transitions, more comprehensive studies are required to fully address 
this relationship. 


Brainwide encoding of internal state 


Weimaged whole-brain calcium activity at cellular resolution in freely 
swimming zebrafish larvae expressing pan-neuronal H2B-GCaMPé6s 
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Right, box plot of the same event-triggered activity binned into low (< 0.3, L), 
middle (0.3 <x< 0.7, M) and high (> 0.7, H) normalized dorsal raphe state 
activity. Box plots throughout depict the median (horizontal line), first and 
third quartiles (box) and the data range within 1.5-fold of the interquartile 
range of the first and third quartiles, respectively. Dorsal raphe state activity 
has a positive relationship with activity of neurons inthe cerebellum (g), eye 
convergence neurons in two lateral hindbrain clusters (i) and asubpopulation 
of eye convergence neurons intwo medial-hindbrain clusters (j). Dorsal raphe 
state activity has anegative relationship with the activity of neurons inthe 
Gad1b cluster 13 region (h) anda subpopulation of eye convergence neuronsin 
two medial hindbrain clusters (k). Dorsal raphe state activity was normalized 
from Otol for each animal to facilitate comparisons of neuromodulation 
across animals (Methods). 


during foraging, and extracted the fluorescence across time F(¢) for 
each neuronal centroid in the brain using non-negative matrix fac- 
torization (NMF)’* (Methods). Using linear regression (Methods), we 
identified neural populations showing correlation to prey detection 
(Extended Data Fig. 5f, i), eye convergence (Extended Data Fig. 5g), 
prey ingestion (Extended Data Fig. 5h), each movement type (Extended 
Data Fig. 5j) and internal state (421+ 141 exploitation-state neurons and 
910 +319 exploration-state neurons per animal, mean +s.d.) (Fig. 2a-e). 

For eachneural population, we applied a spatial P value test to iden- 
tify functional clusters that are spatially consistent across animals 
(Extended Data Fig. 5a, Methods). In particular, the dorsal raphe had 
the highest spatial enrichment of exploitation-state-related neurons 
among all brain regions (Fig. 2a, b, Extended Data Fig. 5b-e, Supple- 
mentary Video 3). Approximately 19 + 8% (mean +s.d.) of dorsal raphe 
neurons encode the exploitation state. 

By PCA analysis of whole-brain activity”, exploitation-state neurons 
define the major axis of global brain activity (Fig. 2d, Extended Data 
Fig. 6). The peak activity of exploitation-state neurons is predictive 
of exploitation-state duration (Extended Data Fig. 7a). The activity of 
these neurons is notably persistent and asymmetric, peaking rapidly 
and decaying slowly (Fig. 2e). Transition from exploitation to explora- 
tion occurs when the activity of these neurons relaxes back to baseline 
(Fig. 2c, e). 

Because some neurons with apparent correlations to exploitation 
state might be attributable to transient, event-locked activity related to 
prey pursuit or ingestion, we excluded increasingly large time intervals 
encompassing all hunting events across the entire experiment (Meth- 
ods). We then repeated the linear regression analysis to confirm that 
exploitation-state neuron activity is persistently and stably correlated 
with behavioural state and not merely due to transient activity around 
individual hunting events (Extended Data Fig. 8a, b). 

We took two further steps to validate the dorsal raphe exploitation- 
state neurons. First, we were able to independently discover these 
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neurons using unsupervised k-means clustering (k=5) onthe pairwise 
activity correlation matrix between all dorsal raphe neurons (Extended 
Data Fig. 8c, Methods). In every fish (n =17), we found that the cluster 
with the highest average intra-cluster correlation encodes the exploi- 
tation state (Extended Data Fig. 8d). Second, we found that the mean 
correlation between behavioural state and the activity of dorsal raphe 
state neurons in the same animal was 0.468, whereas the correlation 
between the behavioural state of one animal and the exploitation-state 
activity of another was 0.024 (Extended Data Fig. 8f). This difference 
is highly significant (P< 10°, two-sample t-test), demonstrating that 
the discovery of dorsal raphe state neurons is not a result of spurious 
correlations’. 

Exploration-associated neurons differ in several important ways. 
Whereas exploration-associated neurons show consistently elevated 
activity during exploration, they do not show asymmetric rise and 
decay kinetics (Fig. 2c, e). Notably, the peak activity of these neurons 
is not predictive of the duration of the exploration state (Extended 
Data Fig. 7a), which suggests that they may be primarily involved in 
the expression of exploration behaviour, but not in determining the 
duration of the exploration state. 

To determine the neuromodulators that participate in exploitation 
and exploration, we examined the activity of serotonergic, dopamin- 
ergic and acetylcholinergic neurons throughout the entire brain using 
the MultiMAP method”°”' (Fig. 2f, Extended Data Fig. 9, Supplementary 
Video 4, Methods). Each neuromodulatory population has distinct 
roles in internal state and hunting, is heterogeneously composed of 
multiple functional types and exhibits distinct temporal dynamics 
(Fig. 2f, Extended Data Fig. 10). 

Serotonergic neurons in the dorsal raphe have persistent activ- 
ity that can last for minutes and correlates with exploitation state 
(Fig. 2f, Extended Data Fig. 10a, d). Serotonergic neurons in the 
dorsal hindbrain (Fig. 2f, Extended Data Fig. 10a, d), dopaminergic 
neurons in the locus coeruleus and medulla oblongata” (Fig. 2f, 
Extended Data Fig. 10b, e), as well as acetylcholinergic neurons inthe 
cerebellum, are preferentially activated by hunting success (Fig. 2f, 
Extended Data Fig. 10c, f). Tectal acetylcholinergic neurons are acti- 
vated when prey enter their receptive fields (Fig. 2f, Extended Data 
Fig. 10f), and the medial rhombencephalic acetylcholinergic popu- 
lation correlates strongly with eye convergence (Fig. 2f, Extended 
Data Fig. 10c). 


State-dependent sensorimotor tuning 

We find that specific sensorimotor clusters are modulated by inter- 
nal state (Fig. 3, Methods). Dorsal raphe state activity correlates with 
significant changes in locomotion, hunting initiation and accuracy 
(Fig. 3a—d). Consistent with this observation, dorsal raphe state activ- 
ity is positively correlated with the activity of the cerebellum Vglut2 
cluster” (Fig. 3e, g)—which encodes eye convergence and prey ingestion 
(Extended Data Fig. 5g, h)—and negatively correlated with the activity 
of Gadib cluster 13 in the hindbrain” (Fig. 3e, h), which encodes routine 
turns (Extended Data Fig. 5j). 

Moreover, even within a functional network that encodes the 
same behavioural event (such as eye convergence or prey ingestion) 
(Extended Data Fig. 5g, h), the degree of correlation with dorsal raphe 
activity varies across different anatomical modules (Fig. 3f, i-k). In 
rhombomere 6, two lateral clusters contain more than 14.1-fold more 
neurons that are positively modulated by state (Fig. 3f, i), whereas 
two medial clusters containa sizable number of neurons that are both 
positively and negatively modulated by state (366 and 145 neurons, 
respectively) (Fig. 3f, j-k). 

Our results suggest that state-dependent neuromodulation targets 
specific sensorimotor modules inthe brain. The diversity of neuromod- 
ulatory targets and effects suggests a global retuning of sensorimotor 
transformations as a function of internal state. 
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Fig. 4| Anetwork of trigger neurons is active at state transition from 
exploration to exploitation. a, Spatial map of trigger neurons that are active 
at the transition from exploration to exploitation state (n=15 animals, all 
neurons shown have a spatial P value < 0.025). Colour bar indicates number of 
neuronal centroids. b, Example fluorescent activity (mean +s.d.) of dorsal 
raphe exploitation-state neurons (red), exploration-state neurons (blue), and 
trigger neurons (yellow) across multiple state oscillations (vertical black lines). 
c, Mean activity of dorsal raphe exploitation-state neurons (red) and trigger 
neurons (yellow) aligned to state transition from exploration to exploitation. 
d, Peak activity of trigger neurons scales linearly with the peak activity of 
dorsal raphe exploitation-state neurons (left). Peak activity of dorsal raphe 
exploitation-state neurons correlates with the duration of exploitation state 
(right). e, Phase space plot of exploitation- and exploration-state activity (top). 
Phase space plot of dorsal raphe state activity and trigger activity (bottom). 
Both plots depict one cycle based on average state-aligned activity. Blue-to-red 
colour map depicts the transition from exploration to exploitation state. The 
return trajectory from exploitation to exploration state is shownin grey. 

f, Zebrafish larvae were placed in the presence of Paramecia and imaged for 

50 minin light, 10 min in darkness (all light sources including fluorescence 
excitation source were turned off), followed by 20 min in light. The 10 min of 
darkness followed by light strongly induces the exploitation state, as shown by 
the persistent suppression of dispersal distance (black) and activation of raphe 
state neurons (red) and trigger neurons (yellow). g, Successful ingestion of 
prey weakly promotes the exploitation state, as shown by suppression of 
dispersal distance, weak activation of trigger neurons and weak activation of 
dorsal raphe state neurons after successful capture attempts (black, yellow, 
and red, respectively) compared with missed capture attempts (grey). Datain 
b,c, f,garemean+s.e.m. 


Neural dynamics at state transitions 

We find, by linear regression, a distributed network of neurons that 
are triggered specifically at the state transition from exploration to 
exploitation (Fig. 4, Methods). Spatially significant clusters are pri- 
marily located in the ventrolateral habenula, dorsal raphe and Gad1b 
clusters inthe rhombencephalon (Fig. 4a—c, Extended Data Fig. 5b-e, 
Supplementary Video 3). The enrichment of the trigger neurons in 
the habenula, which makes both direct and indirect synaptic connec- 
tions to the dorsal raphe**”, suggests a possible circuit mechanism 
for internal state transitions. 

Neurons in this trigger network are weakly activated before state 
transition and peak at the time of the state transition (Fig. 4c). 
The peak activity amplitude of this trigger network is linearly cor- 
related with the peak activation of dorsal raphe state neurons, 
which is in turn correlated with the duration of the exploitation 
state (Fig. 4d). 


The short impulsive dynamics of the trigger neurons and the persis- 
tent asymmetric dynamics of dorsal raphe state neurons (Fig. 4c) can 
be combined to form a two-component nonlinear dynamical system 
that produces a nonsinusoidal repetitive output (Fig. 4e, Extended 
Data Fig. 7). A stochastic dynamical system model can recapitulate 
the distribution of behavioural state durations and the dynamics of 
trigger and state neuron activity (Extended Data Fig. 7e, f, Methods). 

The trigger network can be either spontaneously active or driven 
by sensory stimuli (Fig. 4f, g, Extended Data Fig. 7b-d). For instance, 
10 minutes of darkness followed by light strongly suppresses dispersal 
distance for minutes, triggers transition into the exploitation state 
in all animals (9 out of 9), activates the trigger network and induces a 
prolonged activation of dorsal raphe state neurons (Fig. 4f). In addi- 
tion, following successful prey capture, we also observe a momentary 
decrease in spatial dispersal, a weak activation of the trigger network 
and a brief increase in dorsal raphe state activity (Fig. 4g). This could 
potentially act as amechanism to prolong the exploitation state in an 
advantageous environment. 

Our results demonstrate that zebrafish larvae continuously oscil- 
late between opposing global brain states during foraging, and that a 
dorsal raphe subpopulation may coordinate a fundamental trade-off 
between overall motor activity and motor bias towards hunting. This 
subpopulation encodes a generalized motivational state—not simply 
hunger or satiety””—and may provide an evolutionary link in the neural 
mechanism of internal state oscillations from C. elegans’ to rodents® 
and primates””®. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded 
to allocation during experiments and outcome assessment. 


Simultaneous behavioural and neural imaging by tracking 
microscopy 

The behaviour arena is 35 mm in diameter and 810 pm in height, con- 
structed according to the same design as a previous study’. In brief, 
the top and bottom of the chamber are made of glass to allow optical 
access from above and below. The chamber walls are cut from precast 
gas-permeable polydimethylsiloxane (PDMS; HT-6240, Rogers) using 
a computer-controlled blade (Curio, Silhouette). 

We imaged whole-brain calcium activity at cellular resolution in freely 
swimming larval zebrafish expressing pan-neuronal H2B-GCaMPé6s°, 
a nuclear-localized reporter of neural activity. Tracking microscopy 
was performed as described ina previous study’. In brief, to stabilize 
the brain position of the fish during free behaviour, we use a motion 
cancellation system that consists of a NIR imaging setup for animal 
tracking, abehaviour arena, and X-Y motorized stages. The NIRimaging 
setup consists of aCMOS camera (BFS-U3-32S4M-C, FLIR), cameralens 
(13VM1040ASIR, Tamron), and optical filters (FESO900 and FELO700, 
Thorlabs). The NIR camera collects images of the entire animal at 
250 Hz with a FOV of 10 x 10 mm and a resolution of 608 x 608 pixels 
(16 um per pixel). The behaviour arena is illuminated for dark field 
imaging by a custom ring light consisting of 60 narrow-angle 850 nm 
infrared LEDs (SFH4655-Z, Osram). Each image is precisely timed by 
driving the LEDs with a short pulse (10-100 ys) that is synchronized 
withthe camera exposure. The pulse driver circuit consists of a MOSFET 
(PSMN4R3-30PL,127, NXP Semiconductors) and 10 current-limiting 
series resistors governing 10 strings of 6 LEDs. A pulse duration of 
10-100 us is used to avoid motion blur owing to rapid fish motion. 
The X-Y motorized stages (ILS200LM, Newport) consist of a stacked 
pair of three-phaseironless linear motors and a motor controller (XPS- 
Q8, Newport). Acustom-machined open frame with high stiffness and 
low mass mechanically couples the behaviour arena to the stages and 
creates space underneath the arena for the NIR optics. 

To record neural activity, we used differential illumination focal 
filtering (DIFF)’ microscopy to image 1001 x 761 x 150 yum of the brain 
at cellular resolution at a volume rate of 2 Hz. Each volume consisted of 
50 DIFF-sectioned images, computed from pairs of structured images 
collected at 200 Hz by an sCMOS camera (C13752-50U, Hamamatsu). 
The design and performance of the DIFF imaging system have been 
characterized in depth ina previous study’. 


Hunting assay 

Six-to-eight-days-post-fertilization (dpf) larval zebrafish were held 
without food for at least 1h before the start of the hunting assay. Imme- 
diately before the experiment, approximately 200-500 Paramecia were 
loaded into the behavioural arena. Then, asingle 6-8-dpf zebrafish was 
placed in the behavioural arena for 1-2 h during the assay. The arena 
was illuminated from above by a custom circular array of 60 white 
LEDs (C512A-WNS-CZOCOI151, Cree)’. Hunting success rate shows no 
significant differences when assayed in stationary behavioural arenas 
orinthe tracking microscope, as demonstrated ina previous study’. For 
the analysis in Fig. 4f and Extended Data Fig. 7b-d, a period of complete 
darkness was created by turning off the white LED array and the DIFF 
fluorescence microscope excitation source for 10 min. 


High-resolution offline registration of fluorescent brain 
volumes for each animal 

For each animal, we registered each fluorescent image from the tracking 
microscope toa high-resolution reference brain volume at micrometre 
resolution collected from the same animal. Anin-depth description and 


characterization of the registration pipeline was published in a previous 
study’. In brief, an initial coarse registration is obtained by optimizing a 
3D rigid transformation that maps the moving image to a plane within 
the reference brain volume. This planar surface is then finely subdivided 
into a deformable surface that is locally adjusted within the reference 
volume using a regularized piecewise affine transform. 


Non-negative matrix factorization 

Post-registration, we extracted fluorescence F(t) across time tfor each 
neuron in the brain using NMF” (n =17 animals, 112,699 + 10,370 neu- 
ronal centroids, mean +s.d., per animal). NMF separates each neuron 
into two components—a spatial footprint and a time-varying activity 
component. We applied constrained NMF" to our whole-brain H2B- 
GCaMPé6s datasets. NMF was performed independently for each axial 
section of a given brain volume. There can be double counting of cen- 
troids ifa neuron spans more than one axial plane. Our axial sections 
are separated by 3 um, and the average diameter of a zebrafish neuron 
is approximately 5 um. 


Registration to acommon reference brain across animals 

We used Advanced Normalization Tools (ANTs, v.2.2.0, running onan 
Ubuntu 16.04 Linux server)??? and ITK-SNAP (v.3.6.0, running on 
a Windows desktop) to spatially map live fluorescent brain volumes 
from different animals into acommon reference space (Z-Brain”’). 
The Z-Brain atlas (Elavl3:H2B-RFP line) served as the fixed image, and 
live fluorescent brain volumes were used as the moving images. We 
first aligned the moving images to the fixed image by using ITK-SNAP 
to match the centre, orientation and scale of the brain. The aligned 
moving and fixed images were then saved in the NII file format. By 
combining linear (translation, rigid, similarity and affine, with 3, 6, 
7 and 12 degrees of freedom, respectively) and nonlinear (diffeomor- 
phic symmetric normalization algorithm, SyN) ANTs registration, we 
optimized the compromise between preservation of cell morphol- 
ogy and global alignment and achieved reasonably precise registra- 
tion (within 1-2 cell diameters) of all live fluorescent brain volumes 
to the common reference. After testing a range of values for each 
of the ANTs registration parameters, we finally chose the following 
parameters: antsRegistration -verbose 1-dimensionality 3-float 
1-output [moving image, moving image _Warped.nii.gz, moving 
_image_InverseWarped.nii.gz]-interpolation WelchWindowedSinc- 
use-histogram-matching 0-winsorize-image-intensities [0.005,0.995]- 
initial-moving-transform [fixed_image.nii, moving_image.nii, 
1]-transform Rigid[0.1]-metric MI [fixed_image.nii, moving_image. 
nii, 1,32,Regular,0.25]-convergence [1000x500x250x0,le-8,10]- 
shrink-factors 12x8x4x2-smoothing-sigmas 4x3x2xlvox-transform 
Affine[0.1]-metric MI [fixed_image.nii, moving image.nii, 1,32,Regu- 
lar,0.25]-convergence [1000x500x250x0,1e-8,10]-shrink-factors 
12x8x4x2-smoothing-sigmas 4x3x2xlvox-transform SyN[0.1,6,0]- 
metric CC [fixed_image.nii, moving _image.nii, 1,2]-convergence 
[100x100x70x50x0,1le-7,10]-shrink-factors 12x8x4x2x1-smoothing- 
sigmas 5x3x2x1xOvox. 


Registration of live fluorescent brain volume to immunostained 
brain volume in the same animal using ANTs 

We applied a four-step procedure to register each live fluorescent 
brain volume to the same brain after fixation and immunostaining” 
(Extended Data Fig. 9). In all four steps, we used the immunostained 
brain as the fixed image and the live fluorescent brain as the moving 
image. The moving images were roughly aligned to the fixed image 
using ITK-SNAP (centre, orientation and scale), and then ANTs regis- 
tration was applied. 

As shown in Extended Data Fig. 9a, step 1 (global registration) was 
designed to obtain a global alignment between the live fluorescent 
brain volume and the immunostained brain volume. This step used the 
same ANTs parameters as in the previous section. The overall outline 


of the registered live fluorescent brain volume was well-matched to 
the immunostained brain volume after step 1. However, because of the 
properties of the SyN nonlinear method, the global registration gener- 
ates distortions in some regions of the live fluorescent brain volume, 
especially in the centre of the brain (for example, raphe). To optimize 
the quality of registration on both the global and local scale, we per- 
formed local registration using ANTs (steps 2-4). In step 2 (tiling), the 
fixed and moving images were subdivided into 12 tiles, such that each 
tile had a50% overlap with adjacent tiles. In step 3 (local registration), 
we applied local registration to each of the 12 tiles, using the same 
ANTs parameters as the global registration. In step 4 (merging), all 
locally registered tiles were then merged together by alpha blending 
the overlapping regions. 

In Extended Data Fig. 9b, we show the superimposed live fluorescent 
and immunostained brain volume before registration, after global 
registration and after global and local registration. In Extended Data 
Fig. 9c, we show the immunostained brain volume (green), live fluores- 
cent brain volume (red), and the superimposed volume for the optic 
tectum (white box in Extended Data Fig. 9b) to further illustrate the 
registration performance (for example, note the precise overlap of 
the sparsely distributed neurons in the optic tectum, denoted by the 
white arrows). Post-registration, the correlation between live andimmu- 
nostained brain volumes was 0.7 (Extended Data Fig. 9d), consistent 
with previous studies on high-precision alignment of brain volumes 
using ANTs”. 


Tail tracking 

We used an optimization-based algorithm to trace the curvature of 
the fish tail. Specifically, for each NIR image of the fish, we created 
a high-resolution spline for the tail to enable further classification 
of movements into distinct movement types (see ‘Classification of 
movements into bout types’). A sequential grid search was used to 
fit an optimized tail spline onto the fish tail. This spline consisted of 
37 tail segments, each 80-ym long, parameterized by the local angle 
change at each segment boundary. This high-resolution optimiza- 
tion-based algorithm was specifically developed to achieve robust 
tail tracking in the presence of bright, highly dynamic prey in the 
vicinity of the tail. 


Classification of movements into bout types 

Movement bout detection. Larval zebrafish move in discrete bouts. 
Bout detection was performed by using a density valley clustering 
method to classify each motor movement into one of seven observed 
movement types (Extended Data Fig. 1d-h, Supplementary Video 2), 
as previously described”. 

The presence of noise in the optimization-based tail tracking can 
lead to false positives in the bout detection pipeline that do not cor- 
respond to tail movements. To make the bout detection pipeline more 
robust, we defined a set of diagnostic criteria to detect and exclude 
false positive bouts. Bouts with a minimum tail beat frequency smaller 
than10 Hzcame from failures to separately detect two adjacent bouts 
and were excluded. Bouts with maximum tail beat frequency larger 
than 100 Hz resulted from unstable tail tracking and were excluded as 
well. Bouts with maximum tail tip angle smaller than 5° resulted from 
tracking noise in the tip of the tail and were not included for further 
analysis. Bouts that had high values of maximum angular speed/bout 
maximum angle (>33°ms”) or bout maximum angular speed (>43°ms 1) 
were confirmed to arise from upstream tail tracking errors and were 
excluded. Finally, all bouts were placed in the 73 kinematic parameter 
PCA space. The bouts that had athird principal component value (PC3) 
larger than 3 were outliers in this space. These bouts were confirmed 
to arise from tail tracking noise in stationary fish and were excluded. 


Half-beat detection. Half-beats were detected as previously 
described”. 


Clustering and bout type classification. For each swim bout, we 
computed 73 kinematic parameters based on the bout and half-beat 
detection, as described previously in depth’. PCA was performed on 
the bout kinematic parameters on an initial set of 53 fish that were col- 
lected under hunting conditions with live prey (hunting), or under no- 
prey conditions (spontaneous swimming) in white light illumination. 
We embedded all the data into the PCA space and included for further 
analysis the first six principal components (PCs). We then separated 
the bout data by experimental condition (hunting or spontaneous 
swimming), the condition in which the fish were raised (raised with 
Paramecia or without), the behavioural state (exploration or exploita- 
tion), eye convergence (converged or unconverged), and in the case of 
the final hunting bout of each hunting sequence, the outcome of hunt- 
ing (miss, abort, rejection or success). The datasets with a large number 
of points were still further subdivided into sets of 3,000 points. To find 
the number of clusters in each dataset, we used clusterdv™. To exclude 
outliers, the number of clusters was selected by drawing arectangle on 
the decision plot (separability index versus local density). 

The clusters were used to classify bout types as previously described 
in depth”®. Bout types were classified for 28 foraging animals (1-2 h of 
behavioural recording per animal in the presence of Paramecia), and 
7 nonforaging animals (about 2h of behavioural recording per animal 
in the absence of Paramecia). Among these seven bout types, there 
are two types directly related to hunting events. Approach swims are 
small forward bouts that often occur when the prey is in front of the 
fish’®. J-turns are high precision turns, which are composed of a series 
of unilateral J-bends (> 90°) at the caudal end of the tail (-20% from 
the tip of the tail)”. 


Validation of bout type classification. To validate the bout classifica- 
tion, we calculated—for each bout—12 kinematic parameters based on 
the fish position, heading and tracking of the tail. Displacementis the 
straight-line change in the head position per bout. Forward displace- 
ment corresponds to the forward component of the displacement 
vector. Maximum head yaw is the maximum peak-to-peak change in 
heading after the first bend and counter bend. The maximum bout 
angle is the magnitude of the largest heading change during a bout. 
Maximum angular speed is the maximum rate of change of heading 
during a bout. ‘Tail maximum amplitude 2’ is the maximum absolute 
tail bend amplitude, during a bout, at 0.6 mm from the swim bladder. 
‘Tail maximum amplitude 7’ is the maximum absolute tail bend ampli- 
tude, during a bout, at 2.1mm from the swim bladder. The C1 and C2 
durations are the durations of the first and second half-beats. The C1 
and C2 maximum angular speeds are the maximum angular velocities 
during the first and second half-beats. ‘First and second tail maximum 
amplitudes 7’ are the maximum absolute tail bend amplitudes, during 
the first and second half-beats, at 2.1mm from the swim bladder. 


Paramecia tracking 

Paramecia appear as bright round regions of high intensity in the 
behaviour camera images. Therefore, the simplest strategy for track- 
ing Paramecia is to estimate a static background image, subtract this 
background image from eachimage frame (separating Paramecia from 
background), perform intensity-based blob detection, segment the 
image into appropriately sized regions of interest representing Parame- 
cia, and then extract the coordinates of each paramecium. Our imple- 
mentation followed essentially these steps, but with a modification to 
the front end. During tracking microscopy, the background image for 
the behaviour camera is dynamic, primarily owing to axial motion of 
the imaging objective. Therefore, we replaced the static background 
subtraction step with a deep neural network** ** (DNN). In brief, we 
used DeeperCut” with default parameters to traina model based ona 
pretrained ResNet-50 network to take a raw camera image as input (that 
is, an image with Paramecia and dynamic background) and produce 
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an output image with high pixel intensity at only the Paramecia loca- 
tions (an image with Paramecia and no background). Thus, the DNN 
performs the role of robust background subtraction, and produces a 
cleaner image that can be analysed by conventional intensity-based 
tracking (Fig. 1a, Extended Data Fig. 1a, Supplementary Video 1; see Sup- 
plementary Methods for details). 


Eye position and orientation tracking 

Wealso used the DNN approach for tracking the position, heading and 
eye orientation of the fish (Fig. 1a, Extended Data Fig. 1a, Supplemen- 
tary Video 1). Six fish body parts were tracked by the DNN, consisting 
of front left eye, rear left eye, front right eye, rear right eye, centre of 
the brain and yolk. To generate the training dataset, 3,767 frames were 
randomly extracted from 11 recording sessions. The architecture of the 
DNN reduces the image resolution, such that a pixel in the score map 
corresponds to 8 x 8 pixels in the input frame. Thus, to obtain score 
maps at the same image resolution as the input frames, we first selected 
a100 x100 ROlin each frame, centred onthe centre of the brain to cover 
all six of the tracked fish body parts. The ROIs were then up-sampled 
eightfold (800 x 800) to ensure the score maps had the same image 
resolution as the raw ROIs. The coordinates of the six fish body parts 
were manually labelled in each up-sampled ROI of the training data. 
The up-sampled ROIs of all training frames and the corresponding 
coordinates of the six fish body parts were then used to trainthe DNN. 
We used DeeperCut” with default parameters to train a model based 
onResNet-50 for 1,030,000 iterations ona NVIDIA GeForce GTX 1080 
Ti GPU for -24h. 

We applied the well-trained DNN to all the recording sessions to 
calculate six score maps for each frame, corresponding to the six body 
parts. The resulting images were then processed using intensity-based 
tracking of each body part using a custom Julia implementation. To 
minimize the effects of tracking error, we summed all six score maps 
together to generate a global pattern of all the body parts for each 
frame. We then calculated the orientation of the fish and the eye 
convergence index for each frame. The orientations of the fish were 
defined as the orientations of the rays from the yolk to the centre of 
the brain of the fish. To calculate the eye convergence index, we first 
calculate the orientations of the eyes, defined as the angle of the ray 
from the rear eye to front eye relative to the heading vector of the fish. 
The eye convergence index was then defined as the difference of the 
individual eye angles. For all fish we recorded, there are two eye con- 
vergence states—converged and unconverged”. We use a threshold 
to define a putative start and end of each eye convergence event. To 
exclude false positives and accurately identify bouts in which the eyes 
were converged, we then visually inspected each eye convergence and 
manually set its beginning and end. 


Annotation of hunting outcomes 

We manually annotated the start, end and outcome of each hunting 
attempt. Each hunting sequence starts with eye convergence and ends 
when the eyes deconverge (Extended Data Fig. 1b, c). For each hunting 
sequence, we manually scored its outcome as success, miss, abort and 
reject. Success is defined as a hunting sequence that ends in the swal- 
lowing of the prey without regurgitation. Reject is defined as a hunting 
sequence that ends in prey ingestion followed by regurgitation. Misses 
and aborts are both hunting sequences that terminate with the fish 
deconverging the eyes without ingestion of prey. Misses were defined as 
the subset of cases in which the last bout of the sequence was executed 
towards the prey. Aborts were defined as the subset of cases where the 
last bout of the sequence was executed away from the prey. The hunting 
outcome is correlated with the last movement of a hunting sequence 
(Extended Data Fig. 2d-f). Successes are preferentially associated with 
capture swims, and misses and aborts are preferentially associated with 
high-angle turns and routine turns. Hunting events were annotated for 
28 foraging animals (1-2 h of behavioural recording per animal in the 


presence of Paramecia), and7 nonforaging animals (~2h of behavioural 
recording per animal in the absence of Paramecia). 

We also manually annotated suction events, which are not associated 
witha discrete movement bout. Suction events were observed inasmall 
number of animals (5 out of 25) (Extended Data Fig. 2t). 


HMM 

We fit a Gaussian mixture model to the dispersal distance distribution 
and determined the number of component distributions in the behav- 
iour by the Bayesian information criterion. The resulting label assign- 
ments were used as input to infer the underlying states of aHMMusing 
the Baum Welch algorithm’, implemented in Julia. Exploitation states 
have a significantly lower movement rate than exploration states. In 
Extended Data Fig. 2u, we show further subdivision of the exploitation 
state into a state with very low movement rate (< 0.1Hz) and very low 
spatial dispersal and a state with moderate movement rate and spatial 
dispersal by fitting a three-state HMM toa subset of animals (n=16). We 
find that the inverse relationship between movement rate and hunting 
probability is consistent regardless of whether we fit 2 or 3 states. The 
lower the movement rate, the more strongly individual movements are 
biased towards hunting. We further applied PCA to a comprehensive 
set of ten behavioural parameters (Extended Data Fig. 3d-f). The ten 
behavioural parameters included the rate of each movement type, as 
well as the rate of hunting initiations and outcomes. Because the HMM 
states were based on dispersal distance, we deliberately excluded dis- 
persal distance from this set of ten behavioural parameters, to enable 
an independent check on the HMM states. 


Multivariate linear regression and cross-correlation analysis 
To uncover the neurons with activity related to behavioural states 
and individual behavioural events, we used a combination of linear 
regression and cross-correlation. We used cross-correlation analysis to 
determine the timing of neural signals with respect to individual events 
(for example, prey ingestion, routine turns and so on). For each class 
of behavioural event, a neuron must have its event-aligned average 
activation peak within 2s of the event to be included in the functional 
network for that event class. We then use linear regression to estimate 
the strength of the relationship between the neural activity and each 
class of behavioural event. For each class of behavioural event, neurons 
must havea significant correlation to events of that class (P< 0.001) to 
be included in the functional network for the event class. To uncover 
the visual network for prey detection, the visual field was subdivided 
into eight angular segments (-120° to 120°). Distance-weighted Para- 
mecia position within each segment was used to generate one of eight 
regressors for prey angles (for example, the retinotopic map of prey 
position in the brain in Extended Data Fig. 5f, i). Paramecia distance 
fromthe head of the fish was exponentially weighted (k=-0.973 mm). 

To verify the hypothesis that the exploitation-state-encoding neu- 
rons encoded the exploitation state and not simply hunting events, 
we excluded increasingly large time intervals (0.5s to 10s witha step 
of 0.5 s) encompassing all hunting events across the entire experi- 
ment (Extended Data Fig. 8a). We then recalculated the correlations 
between the activity of these neurons and the exploitation state after 
the temporal exclusion. As shown in Extended Data Fig. 8b, the dorsal 
raphe exploitation-state neuron population showed higher correlation 
than the set of all exploitation-state neurons, and the correlation coef- 
ficients of both groups were highly stable across all temporal exclusion 
windows from 0.5sto10s. This indicates that the correlation between 
behavioural exploitation state and the activity of exploitation-state 
neurons is not simply an artefact of increased hunting-related activity 
during the exploitation state but can be also decoded during inter- 
movement intervals. 

The consistently higher rate of locomotion during exploration is not 
sufficiently resolved by the calcium reporter to permit the removal of time 
intervals containing individual movement events during exploration. 


PCA of whole-brain neural activity 

PCA of whole-brain neural activity provided an unsupervised perspec- 
tive for demonstrating the existence of exploration and exploitation 
states in the brain activity. We used the NMF-factored neural activity 
as the input to PCA. To eliminate animal handling effects at the begin- 
ning of the experiment, the first ten minutes of neural activity were 
excluded from PCA for each recording session. We applied PCA to the 
NMF-factored neural activity (without preselection, as shownin the first 
column of Extended Data Fig. 6) and projected the high-dimensional 
neural population activity into a three-dimensional PCA space for visu- 
alization (as shown in Fig. 2d, Extended Data Fig. 4b and the second 
column of Extended Data Fig. 6). As shown in the fourth column of 
Extended Data Fig. 6, all PCA trajectories were colour-coded by the 
normalized amplitude (ranging from 0 to 1) of the mean activity of 
exploitation-state neurons (third column of Extended Data Fig. 6). 


k-means clustering to independently uncover exploitation- 
state-encoding subpopulations in the dorsal raphe 

In addition to identifying exploitation-state-encoding dorsal raphe 
neurons by using linear regression to behaviour states, we conducted 
an independent clustering-based procedure to verify the existence of 
exploitation-state-encoding neurons in the dorsal raphe (Extended 
Data Fig. 8c-f). We correlated neural activity between all pairs of dorsal 
raphe neurons to generate a correlation matrix for each fish. An unsu- 
pervised k-means clustering analysis was then applied to the pairwise 
correlation matrix to cluster dorsal raphe neurons into five clusters. 
We selected the tightest-correlated cluster (with highest average intra- 
cluster correlation) from each animal and examined its neural activity 
during the transition of exploration states to exploitation states. In 
every fish (17 out of 17), the tightest-correlated dorsal raphe cluster 
showed significantly higher activity in exploitation state than explo- 
ration state. 


Comparison of correlation within and across fish 


After selecting the tightest-correlated cluster in each animal, we cal- 
culated the correlation ch between the neural activity of neuron kin 
the dorsal raphe subpopulation of fish i, and the behavioural state of 
fishj. We then calculated the mean correlation coefficient C; between 
fish iand fishj by averaging over each dorsal raphe neuron k in the 
selected cluster. As shown in Extended Data Fig. 8e, the neural activity 
of the dorsal raphe subpopulation is reliably correlated with the behav- 
iour state of the same fish but not with the behavioural state of other 
fish, which confirms that these are real correlations and not spurious. 
For quantitative comparison, we compared ch under two conditions: 
i=j(on-diagonal blocks of Extended Data Fig. 8e) andi #/ (off-diago- 
nal blocks of Extended Data Fig. 8e). We observed that the mean of Ci 
for i=jwas 0.468, and the mean of Ci for i#jwas 0.024. This difference 
was highly significant (Extended Data Fig. 8f, P< 10-8, two-sample 
t-test). 

In brain regions that do not encode exploitation state such as the 
inferior olive (Extended Data Fig. 8g-j), the cluster with the highest 
average intracluster correlation is not persistently active during the 
exploitation state (Extended Data Fig. 8g, h). Furthermore, repeating 
the correlation analysis within and across animals, the difference is not 
significant (Extended Data Fig. 8i, j). The mean correlation between 
behavioural state and dorsal raphe state neuron activity in the same 
animal was —0.047, whereas the correlation between the behavioural 
state of one animal and the exploitation state activity of another was 
0.023 (Extended Data Fig. 8j, P=1, two-sample t-test). 


Normalization of dorsal raphe state activity 

To facilitate pooling of data across animals in Fig. 3, dorsal raphe state 
activity was normalized for each animal to a range of 0 to 1 according 
tothe formula 


X~Xmin 


Xy =>, — 
- Xmax — Xmin 


In Fig. 3a—d, Xin aNd X,,4, are defined as the mean of the bottom 
and top tenth percentile, respectively, of dorsal raphe state activ- 
ity across the entire experiment. Normalization was applied to 17 
animals. One of the 17 animals was excluded for analysis owing to 
the low number of total successful hunting attempts. In Fig. 3e-k, 
we compute the correlation between normalized dorsal raphe state 
activity and activity of sensorimotor neurons throughout the brain 
at the time of hunting initiation. For each animal, we generate a 
vector of dorsal raphe state activity at the time of hunting initia- 
tion. This vector is then normalized to range from 0 to 1 by defining 
X min ANd Xnax to be the minimum and maximum values in the vector, 
respectively. 


Spatial P-value analysis 

We developed a spatial P-value analysis method to evaluate the sig- 
nificance of spatial colocalization of functionally classified neurons 
across animals. In brief, we registered all 17 functionally imaged brains 
toa common reference zebrafish brain from the Z-Brain atlas”, com- 
puted the distance of each neuron to its nearest neighbour (sharing 
the same state association) across all coregistered brains and then 
compared to a null model defined by random sampling. As shown in 
Extended Data Fig. 5a, the spatial P-value analysis followed a two-step 
procedure. In step 1, we applied ANTs registration to transform the live 
fluorescent brain volumes of 17 animals into acommon reference space 
defined by Z-Brain. In step 2, we calculated the spatial P value for each 
functionally classified neuron in each animal by examining its spatial 
colocalization across all 17 fish (Extended Data Fig. 5a). For instance, 
to calculate the spatial P value of a functionally classified neuron in 
fish i (shown as red star), we calculate the shortest distance drto any 
neuron in fish / that was classified into the same functional class 
to generate a same-class distance vector [df, ;, ..., df, j-1, Uf j, «.- Uf, ml, 
in which m= 17. We then calculated the shortest distance dn between 
the neuron in fishiand a random size-matched population of neurons 
in fish obtained by randomly sampling all neurons in fish/, with 
the number of neurons sampled equal to the number of neurons in 
the functional class in fishj. Therefore, a random distance vector 
[dn; 1, ..., dN; j-1, AN; ;, ..., dN; m] was calculated. Before performing a 
scalar statistical test to obtain the spatial Pvalue, we adopted a projec- 
tion method to handle the possibility of 1 or 2 outlier fish. The projec- 
tion function used in our analysis was the mean of the lowest 90% of 
the distances within each distance vector, resulting in scalar distances 
dr and dn. We then repeated the random sampling process N times 
(N=10,000 in our analysis) to construct a null distribution of the short- 
est distances between a functionally classified neuron in fish iand the 
randomly sampled neurons in other fish. We then fit this null distribu- 
tion with a normal distribution. The P value for this functionally clas- 
sified neuron in fish i relative to all other fish was then calculated by 
comparing dr to the normally distributed null model obtained from 
sampling dn. We only reject the null hypothesis if a functionally clas- 
sified neuron in fish ihas a spatial P value < 0.025—that is, if the prob- 
ability that the null model generating a distance less than or equal to 
the projected distance is less than 0.025. 


Neuromodulation of sensorimotor activity by dorsal raphe 

state neurons 

We analysed the activity of sensory and premotor neurons throughout 
the brain asa function of dorsal raphe state activity by evaluating the 
significance of their correlation. To test the statistical significance of 
the correlation value for each neuron, we created a null distribution of 
correlation values by randomly permuting the activity of each neuron 
10,000 times. For each neuron, we compared the null distribution of 
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its correlation values against its original correlation value (neurons 
with P value < 0.05 are shown in Fig. 3e, f). 


Astochastic nonlinear oscillator model of internal state 
We developed a stochastic nonlinear dynamical system model of explo- 
ration and exploitation (Extended Data Fig. 7e, f): 


ar = 2 ASPOM 5p — pSPON) 4 Agvoked 5(p_ evoked) _ yr 
@ = aT-B-1s,, 
in which 
) 15 ae 


is an indicator function for exploitation state. 

When the state variable is low (S = 0), the system isin exploration state. 
Spontaneous trigger events arrive according to a Poisson process with 
rate Ay. The duration D, of the exploration state is exponentially distrib- 
uted, Do ~ exp(A), with an average waiting time of 1/A,. Each spontane- 
ous trigger event is represented as a delta function 6(¢- t}°"), 
representing a spontaneous eventat time ¢7?°",, multiplied by a stochas- 
tic amplitude 4;?°". We model both the amplitude 4;?°" and the wait- 
ing time between spontaneous trigger events 7;P°" = £70" — t7P?"™" as 
exponentially distributed random variables, with 4;?°™ ~ exp(A,)and 
T;P°"".. exp(A). With each spontaneous event, the trigger signal rises 
impulsively to 47°", and then decays exponentially at rate y. We can 
configure the model parameters such that the state variable S(t) is rap- 
idly driven to k A;°™ (with k determined primarily by wand yp), and then 
undergoes slow linear decay at rate B. While the state variable is elevated 
(S >), the system is in exploitation state. The duration D}°™ of the 
exploitation state is set by k A??°"'/B, so duration D;”°™ follows an expo- 
nential distribution, D;?°" ~ exp(A,8/k) . When fitting the model to 
empirical data, we fit the event amplitudes 4;?°" and times ¢?°" indi- 
vidually for each transition. The remaining model parameters 
(a, Band y) were obtained by simultaneously fitting the model equations 
for T(t) and S(t) to the dynamics of the activity of trigger and exploita- 
tion state neurons during 14 exploitation states having a duration of at 
least 10 min (14 out of 17 fish had state durations of =10 min). The joint 
fitting procedure yielded @=1.5, B=0.1min“and y=1.3 min’. 


Immunostaining 

We first performed whole-brain calcium imaging during foraging 
for 80 min, and then immediately fixed and immunostained the 
brain for neuromodulatory markers such as serotonin®* °°, dopa- 
mine”** or acetylcholine” using the MultiMAP method” to relate 
antibody-positive neurons to their neural activity. The immunohis- 
tochemical method to fluorescently label larval zebrafish neural 
populations used for these experiments, as well as at least ten con- 
trol embryos for each antibody, is a slightly modified version of the 
MultiMAP method published previously”°. We found this method 
yielded the same neuronal labelling patterns as both the standard 
PFA immunolabelling methods and published gene expression 
patterns*°. In contrast to the standard PFA immunolabelling tech- 
nique, our modified protocol yielded better antibody penetration 
throughout the entire brain. As previously described”’, we found 
that by embedding the intact fish, the integrity of brain structures 
was preserved, allowing registration of live neurons with immuno- 
labelled neurons. 

In brief, after functional imaging, larvae were embedded in blocks of 
2.5% intermediate melt agarose, immersed inice-cold 4% PFA in phos- 
phate buffered saline with 0.25% Triton-X100 (PBST) and incubated 
onarocker at 4 °C overnight. 


Larvae were rinsed 3 times in ice-cold PBST buffer, then incubated for 
several hours with PBST buffer, changed every 2 hat 4°C. Allantibody 
incubations were performed at 4 °C. Primary antibodies were added at 
a1:500 dilution in PBST and rocked for 72h, followed by several hours 
of washing with PBST buffer, changed every 2h. Secondary antibodies 
were added at a 1:500 dilution in PBST and rocked for another 72 h, 
followed by several hours of washing with PBST. To image, embedded 
fish were mounted in 2% agarose and imaged using a Zeiss 880 confo- 
cal microscope with a water immersion 20x objective. Z stacks were 
collected with pixel dimensions of 0.998 ppm x 0.998 pm x 2 um. The 
image size was 638.9 pum x 638.9 pm. 

The primary antibody used to label dopamine neurons was mouse 
anti-tyrosine hydroxylase (Immunostar 22941). The primary antibody 
used to label serotonin neurons was rabbit anti-SHT (Immunostar 
20080). The primary antibody used to label acetylcholine neurons 
was goat anti-choline acetyltransferase (Millipore AB144P). 

The secondary antibodies used were donkey anti-mouse Alexa647 
(Jackson ImmunoResearch 715-606-151), donkey anti-rabbit Alexa594 
(Jackson ImmunoResearch 711-586-152) and donkey anti-goat Alexa647 
(Jackson ImmunoResearch 705-606-147). 

For display purposes only, Supplementary Video 4 of dopaminergic 
and acetylcholinergic brain stacks were gamma-adjusted witha gamma 
value of 1.7 in Photoshop. 


Satiety control experiment and starved control experiment 
We first examined the relation between exploitation/exploration tran- 
sitions and changes in satiety by placing larvae in a behavioural arena 
without prey, which has no change in satiety from prey consumption. 
As expected, the probability of hunting-related movements is lower 
in the absence of prey (Extended Data Fig. 4d). However, all animals 
continue to show state-dependent modulation of locomotion and 
hunting-related movements in the absence of prey (Extended Data 
Fig. 4d). Animals continue to prefer routine turns and slow swims 2 dur- 
ing exploration and hunting-related movements such asJ-turns during 
exploitation (Extended Data Fig. 4d). Overall the amount of time spent 
inthe exploitation state is slightly elevated in the absence of prey, but 
this difference was not statistically significant (Extended Data Fig. 4c). 
We further tested whether satiety can act as a modulator of exploita- 
tion and exploration state dynamics. We compared larvae that were 
fed until the start of the experiment with siblings that were starved 
for 24 h. We observed that starved fish also showed an increase in the 
probability of hunting-related movement such as J-turns (Extended 
Data Fig. 4f) across the entire experiment, suggesting that hunger can 
also globally elevate hunting. Thus, an intrinsic mechanism exists to 
drive spontaneous oscillations in decision-making during foraging. 


Animal care and transgenic lines 

All animal experiments were approved by the Harvard University Insti- 
tutional Animal Care and Use Committee. All imaging experiments used 
Tg(elavl3:H2B-GCaMP6s*”")® with nacre (mitfa” ). Behavioural experi- 
ments used Tg(elavl3:H2B-GCaMP6s‘"*) and Tg(elavl3:GCaMPé6s*")°, 
both with nacre (mitfa~ ). All zebrafish (Danio rerio) were 6-8 dpf. Fish 
were reared ona 12:12 hlight:dark cycle at 25 °C. They were maintained 
in groups of 20 in E3 water and were fed Paramecia daily. 


Statistical analysis 

Sample sizes and statistics are reported in the figure legends and text 
for each measurement. For comparison of behaviour across exploration 
and exploitation states, we used a two-tailed Wilcoxon signed-rank test. 
Adjusted Pvalues are corrected for multiple comparisons by the Holm- 
Bonferroni method. Box plots depict the median (horizontal line), first 
and third quartiles (box), and the smallest and largest data within 1.5-fold 
of the interquartile range of the first and third quartiles, respectively. 
Sample size was not predetermined by a statistical method. The experi- 
ments were not randomized and were not performed with blinding. 


Computer hardware for data acquisition and analysis 

The tracking microscope was controlled bya rack-mounted eight-core 
(i7-5960X, Intel) workstation with two GPUs (GTX 1080, NVIDIA). Offline 
data analysis was performed using four rack-mounted eight-core Linux 
servers (i7-6900K, Intel) with four GPUs each (GTX 1080 Ti, NVIDIA). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The data supporting the findings of this study are available from the 
corresponding authors upon request. 


Code availability 


All software was written in Julia 0.6.3 and C++11 using the Julia package 
ecosystem (Optim.jl, PyPlot.jl, Images.jl, Cairo.jl, HDF5.jl, Lasso.jl, 
CUDAdrv.jl and CUDArt.jl). GPU processing was implemented for fish 
tracking, as well as online and offline image registration. Image regis- 
tration and data analysis code was based on published work®!0162132,3, 
and is available upon request. 
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Extended Data Fig. 1| See next page for caption. 
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Extended Data Fig. 1| DNN pipeline for automated tracking of Paramecia 
and fish and density valley clustering identifies seven bout types performed 
by larvae during foraging. a, Automated tracking of Paramecia and fish 
followed a four-step procedure (Methods). The rightmost panel shows the 
automated annotation of Paramecia position (yellow circles), eye position 
(dark blue, blue, yellow and red dots), yolk position (green dot) and brain 
position (light green dot). Fish heading (red line) and eye convergence index 
(angle in cyan) were calculated using eye, and yolk, and brain positions 
(Methods). b, Schematic of eye convergence index when the fish is not hunting 
(left) or hunting (right). c, Calculation of the eye convergence index. Top, 
angles of the left and right eyes are calculated from the two detected points of 
each eye. Bottom, eye convergence index is calculated as the difference of the 
individual eye angles. By setting a threshold for the eye convergence index, we 
define the start and end of each hunting sequence. d-i, The bout clusters that 
larvae executed during functional imaging with the tracking microscope 


(25 larvae in the presence of Paramecia and 7 in the absence of Paramecia) were 
identified by clusterdv (Methods). To estimate the similarity between bout 
clusters, pairwise ROC curves were calculated. d, Dimensionality reduction 
using ¢t-distributed stochastic neighbour embedding (t-SNE). e, Evidence 
accumulation matrix of clusterdv assignments was constructed by repeating 
the t-SNE embedding 100 times. The rows and columns of the matrix were 
sorted by single link clustering. f, Dendrogram of e. Greenline marks the 
largest jump on the dendrogram and determines the number of bouts types 
(seven) that were detected. g, First 2 PCs of 73 kinematic parameters that were 
calculated for each bout. Colours represent bout categories fromf.h, Angle of 
caudal tail segment versus time of 50 randomly picked bouts of each type. Cyan 
lines are the average of all bouts in each category. i, Bout kinematic 
distributions were obtained by using all the bouts of eachtype. Colours 
represent bout typesas inh. 
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Extended Data Fig. 2|See next page for caption. 
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Extended Data Fig. 2 | Sequential Markov decision process of larval foraging 
behaviour, final hunting bout is modulated by foraging state, and 
movement rate and hunting probability are inversely related. a—r, Markov 
decision process of larval foraging behaviour. A schematic of foraging 
behaviour as a Markov decision process is shown for all time (a), exploitation 
state (g) or exploration state (m). Each node represents a decision point inthe 
hunting sequence, and each arrow has a state-dependent transition probability 
(grey number next to arrow). The ‘free behaviour’ and ‘pursuit’ nodes are 
composed of sequences of multiple bout types that may occur with distinct 
probabilities. b,c,h,i,n, 0, Bout type probabilities for the free behaviour node 
(b,h, n), which occurs when the eyes are unconverged, and the pursuit node 
(c,i, 0), which takes place when the eyes are converged and the fish is chasing 
prey. d-f, j-l, p-r, Bout type probabilities of the last bout of the pursuit 
sequence for each of the hunting outcomes: prey in mouth (d,j, p), abort 

(e, k, q) or miss (f, I, r). Once the prey is in the mouth, the fish can swallow 
(success) or spit it out (rejection). The colours in the box plots represent each 
observed bout type (legend, top right corner). Statistical analysis is presented 
in Supplementary Table1.s, Movement type that the fish executes at the end of 
each hunting sequence during exploitation and exploration (n=25 animals, 
2,972 hunting attempts). t, Box plots of state-dependent differences inthe 
probability of exploration-related movement types (top), probability of 


hunting-related movement types (middle) and probability of hunting 
outcomes (bottom). Probability of amovement type is defined as the fraction 
of all movements that are of a given type, and probability of hunting outcomeis 
defined as the fraction of all hunting sequences that end witha given outcome. 
Adjusted Pvalues for state-dependent differences: routine turn (7.37 x 10°), 
slow swim type 1 (0.092), slow swim type 2 (9.83 x 10°), eye convergence 

(6.14 x 10°), J-turn (8.60 x10), approach swim (9.83 x 10), capture swim 

(7.63 x10), high angle turn (1.0 x 10°), suction (0.25), success (0.020), 
rejection (0.63) and miss (3.5 x 10°). n= 25 animals, 2,972 hunting attempts and 
88,814 movements. u, To more finely parse state-dependent behaviour, we 
further subdivide the dispersal data by fitting a3-state HMM. This resultsina 
state with very low movement rate (< 0.1Hz, yellow) and spatial dispersal, a 
state with intermediate movement rate and spatial dispersal (red) andastate 
with very high movement rate and dispersal (blue). We find that the inverse 
relationship between movement rate and hunting probability is consistent 
regardless of whether we fit two (t) or three states (u). The lower the movement 
rate, the more strongly individual movements are biased towards hunting 
(n=16 animals, 1,605 hunting attempts and 40,069 movements). s—u, Pairwise 
comparisons between the states was performed by using a two-tailed Wilcoxon 
signed-rank test. Adjusted P values have been corrected for multiple 
comparisons by Holm-Bonferroni method. *P<0.05, **P< 0.0land***P< 0.001. 
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Extended Data Fig. 3 | Distribution of behavioural parameters across 
exploitation and exploration state. a, Distribution of dispersal distance (left), 
and colour-coded (right) by whether the observation occurred during 
exploration (blue) or exploitation (red) state, as classified by the HMM 
(Methods). To facilitate comparison across animals, the dispersal distance was 
normalized to the range of each animal. b, Distribution of hunting probability 
across all time (left), during exploration state (blue) and during exploitation 
state (red). c, Distribution of probability for hunting, approach swim, J-turn, 
capture swim, routine turn, slow swim typeI, slow swim type Il and high-angle 
turn, plotted against dispersal distance, and colour-coded by whether the 
observation occurred during exploitation (red) or exploration (blue) state. 
d-f, PCA of foraging behaviour reveals a bimodal distribution of behavioural 
variables. PCA was performed using ten behavioural variables: rate of each 
movement type (approach swim, J-turn, capture swim, routine turn, slow swim 


typel, slowswim type Il and high-angle turn), rate of all hunting sequences, rate 
of successful hunting sequences and rate of aborted hunting sequences 
(Methods). d, Distribution of the first PC (left), and colour-coded (right) by 
whether the observation was classified as exploration (blue) or exploitation 
(red) by the HMM. e, Variance explained by each PC. f, Distribution of all 
behavioural parameters projected into the first three PC dimensions, colour- 
coded by whether the observation occurred during exploitation (red) or 
exploration (blue) state. For a-f, n=19 animals. g, Normalized state duration 
distributions for exploitation (top) and exploration (bottom) (n= 36 animals), 
divided into 20 bins. The state durations of each animal were normalized by the 
mean state duration of that animal (normalization was separately performed 
for exploitation state and exploration state), and then used to fit an 
exponential model by maximum likelihood (red and blue lines). 
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Extended Data Fig. 4 | Exploitation-exploration state transitions occur in 
the absence of prey and are not modulated by starvation. a, Behavioural 
state transitions in two animals, each placed in an arena without Paramecia 

for over an hour. b, PCA trajectory based on whole-brain activity froma 
representative animal in the absence of prey, colour-coded by the activity of 
exploitation-state encoding neurons. c, Fraction of time in exploitation state in 
the absence (grey) and presence (purple) of Paramecia (25 animals in the 
presence of Paramecia and 7 animals in the absence of Paramecia). d, Box plots 
of state-dependent differences in movement types (n=7 animals, 13,997 


movement events). The number of capture and rejection events were both 0, 
because there was no prey. e, f, Comparison between starved and fed larvae. 

e, Fraction of time in exploitation state for the first 10 min (left), and the entire 
duration of the experiment (right). f, Probabilities by movement type for larvae 
that where fed (green) or starved for 24 h before the experiment (yellow) (n=5 
animals in fed condition, n=9 animals in 24 hstarved condition). c-f, Adjusted 
Pvalue (above each plot) obtained by two-tailed Wilcoxon signed-rank test. 
Adjusted P values have been corrected for multiple comparisons by Holm- 
Bonferroni method. 
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Extended Data Fig. 5 | Spatial Pvalue analysis, spatial distribution of 
neurons encoding exploitation state, exploration state, and transition from 
exploration to exploitation, and sensorimotor networks activated during 
hunting. a, The spatial Pvalue analysis followed a two-step procedure. Instep1, 
ANTs registration was performed to transform the live fluorescent brain 
volumes of 17 animals into a common reference space defined by Z-Brain. In 
step 2, a spatial Pvalue was calculated for each functionally classified neuronin 
each animal by examining its spatial colocalization across all 17 fish. For 
instance, to calculate the spatial Pvalue of a functionally classified neuronin 
fish i(shownas red star), we calculate the shortest distance drto any neuronin 
fishj that was classified into the same functional class to generate asame-class 
distance vector dr;,,...,dr;;-4,d1;;,..-,d; m in which m=17. We then calculated the 
shortest distance dn between the neuronin fishiand arandom size-matched 
population of neurons in fish/ obtained by randomly sampling all neuronsin 
fish/, with the number of neurons sampled equal to the number of neurons 

in the functional class in fishj. Therefore, arandomized distance vector 
dn;,,...,dN;;-1,dN; j,...,dN;, m Was Calculated. To facilitate comparison of these 
distance vectors, we reduce them to a single scalar value by calculating the 
mean of the lowest 90% of the distances within the same-class distance vector 
and random distance vector. We then repeated the random sampling process 
Ntimes (N=10,000 in our analysis) to construct a null distribution of the 
shortest distance betweena functionally classified neuron in fishiand the 
randomly sampled neurons in other animals. We then fit this null distribution 


with anormal distribution. The Pvalue for this functionally classified neuronin 
fishirelative to all other animals was then calculated by evaluating dr onthe 
basis of anormal distribution model. We reject the null hypothesis of random 
sampling if a functionally classified neuron in fishihas a spatial Pvalue 

< 0.025. b, The spatial density for each area in the Z-Brain map was calculated 
for neurons that encode exploitation (left), exploration (middle), and 
transition from exploration to exploitation (right). c-e, Brainregions with 
neural densities higher than two standard deviations (measured across all 
brain regions) for exploitation-state-encoding neurons (c), exploration-state- 
encoding neurons (d) and trigger neurons that encode the transition from 
exploration to exploitation (e) (n=17 animals; only neurons with consistent 
spatial localization and spatial Pvalue < 0.025 are considered for this analysis). 
f, Retinotopic map of prey detection neurons in the optic tectum. Left, 
anatomical position of neurons tuned to different prey angles (Methods). 
Right, angular tuning curves of optic tectum neurons classified by angle of 
peak prey response, in 30° increments. g, Location of all neurons time-locked 
to the onset of eye convergence. h, All neurons time-locked to successful prey 
ingestion. i, Retinotopic map for each prey detection angle depicted 
individually across eight maps.j, Motor networks that drive distinct movement 
types associated with pursuit and capture of prey. Motor networks were 
identified for allseven observed movement types. All panels depict neurons 
with spatially significant colocalization across animals (spatial Pvalue < 0.025, 
n=17 animals). Scale bars, 50 pm. 
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neuronal centroids identified from NMF (Methods) for each animal. The trajectories colour-coded by the activity of exploitation-state-encoding 


second column shows PCA trajectories in the first three PCA dimensions. neurons in each animal. 
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Extended Data Fig. 7| Correlation between state duration and activity of 
dorsal raphe state neurons and exploration-state-encoding neurons, 
multimodal activation of the exploitation state, and stochastic model of 
trigger and state signals. a, Top, peak activities of exploration state neurons 
are not correlated with the duration of exploration state. Bottom, peak 
activities of dorsal raphe exploitation state encoding neurons correlate with 
the duration of exploitation state. Activity is in units of z-scored fluorescence. 
b-d, The trigger network that is active at the transition from exploration to 
exploitation contains both modality-specific neurons (b, c), that are only active 
at either spontaneous transitions (b) or at stimulus-evoked transitions (c), as 
well as cross-modal neurons that are activated at all state transitions regardless 
of whether the transition is stimulus-evoked or spontaneous (d). e, Schematic 
of the stochastic model. Spontaneous and stimulus-evoked trigger events are 
represented by delta functions with variable (integrated) amplitudes. The 
spontaneous trigger events arrive according to a Poisson process during 
exploration state, triggering the exit from exploration state after an 
exponentially distributed waiting time. Each delta function causes the trigger 
signal to undergo an impulsive rise, followed by rapid exponential decay. 


The state signal integrates the trigger signal, and then undergoes a slow linear 
decay back to baseline. Exploitation state corresponds to the time intervals 
when thestate signal is elevated above baseline. Spontaneous trigger 
amplitudes are exponentially distributed, which leads toa corresponding 
amplitude in the state signal, and effectively determines the duration of the 
ensuing exploitation state due to the slow linear decay of the state signal. f, Two 
examples of recorded trigger and state signals during long exploitation states, 
together with corresponding signals in the fitted model. Event amplitudes 
Ajo" and times ¢7°°™ are fit individually for each transition. The shared model 
parameters (a, B, y) were obtained by simultaneous least squares fitting of the 
equations for 7(¢) and S(t) to the dynamics of the population mean activity of 
trigger and exploitation state neurons during 14 exploitation states having a 
duration of at least 10 min (obtained from n=17 fish). The joint fitting 
procedure yielded a=1.5, B=0.1min"™, and y=1.3 min”. These parameters, 
together with the fitted behavioural state distributions from Fig. 1d (obtained 
from n=36 fish), comprise the conceptual model of the stochastic nonlinear 
dynamical system for exploration and exploitation. 
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Extended Data Fig. 8 | Validation of exploitation-state-encoding dorsal 
raphe neurons. a, b, Exploitation-state-encoding neurons have stable 
correlation to state even after excluding activity around hunting events. a, To 
verify the hypothesis that spatially significant (spatial Pvalue < 0.025) 
exploitation-state-encoding neurons encode a persistent state and not simply 
hunting events, we removed the neural activity around all hunting events 
across the entire experiment, with increasing time intervals ranging from0.5s 
tol0sinsteps of 0.5s.b, State-encoding neurons remain stably correlated to 
state after removal of increasingly large time windows around hunting events. 
Orange, spatially significant (spatial Pvalue < 0.025) dorsal raphe exploitation- 
state-encoding neurons after spatial Pvalue significance test. Blue, all spatially 
significant (spatial P value < 0.025) exploitation-state-encoding neurons 
(n=17 animals). Standard error of all exploitation-state-encoding neurons is 

< 0.0024, and standard error of dorsal raphe exploitation-state-encoding 
neurons is < 0.0041. c-f, Validation of exploitation-state-encoding dorsal 
raphe neurons by clustering analysis. As acontrol test, we also conducted the 
same analysis on another brain region, the inferior olive (g-j).c, g, Clustering 


analysis of dorsal raphe neurons (c) and inferior olive neurons (control) (g). 
d,h, Activity of the tightest-correlated cluster (lowest average intra-cluster 
distance) in the dorsal raphe (d) and inferior olive (h) of each animal aligned to 
state transition from exploration to exploitation. e, i, Cross-validation of 
exploitation-state-encoding activity across animals for dorsal raphe (e) and 
inferior olive neurons (i). Each element of the array represents the mean 
correlation coefficient C; between the neural activity of the tightest-correlated 
dorsal raphe cluster of fishiand behaviour state of fish/. f, Statistical analysis 
between dorsal raphe neurons in on-diagonal elements (i=/) and neuronsin 
off-diagonal elements (i#/). The mean correlation coefficient Ci fori=jwas 
0.468, whereas the mean correlation coefficient G fori#jwas 0.024. A two- 
sample t-test analysis verified that there was a significant difference between 
these two groups (P<10°°, n=17 animals).j, The same statistical analysis 
showed there was no significant difference between neurons in on-diagonal 
elements and neurons in off-diagonal elements for the inferior olive (mean 
correlation coefficient os for i=j was -0.047, mean correlation coefficient ci 
for i#j was 0.023 (P=1, two-sample ¢-test). 
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a total of 17 animals was used. 
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Antibodies used Primary antibodies used to label dopamine neurons were mouse anti-tyrosine hydroxylase (Immunostar 22941). Primary 
antibodies used to label serotonin neurons were rabbit anti-SHT (serotonin) (Immunostar 20080). Primary antibodies used to 
label acetylcholine neurons were goat anti-Choline Acetyltransferase (Millipore AB144P). 

Secondary antibodies used were donkey anti-mouse Alexa647 (Jackson ImmunoResearch 715-606-151), donkey anti-rabbit 
Alexa594 (Jackson ImmunoResearch 711-586-152) and donkey anti-goat Alexa647 (Jackson ImmunoResearch 705-606-147). 


Validation We validated immunostaining patterns for dopamine (anti-tyrosine hydroxylase), serotonin (anti-SHT), and CHAT (anti-Choline 
Acetyltransferase) against previous publications: 
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Ryu S, Holzschuh J, Mahler J, Driever W. Genetic analysis of dopaminergic system development in zebrafish. J Neural Transm 
Suppl. 2006; 70:61-66. 

Filippi A, Mahler J, Schweitzer J, Driever W. Expression of the paralogous tyrosine hydroxylase encoding genes th1 and th2 
reveals the full complement of dopaminergic and noradrenergic neurons in zebrafish larval and juvenile brain. J Comp Neurol. 
2010; 518:423—438. [PubMed: 20017209 


Serotonin: 
Norton WH, Folchert A, Bally-Cuif L. Comparative analysis of serotonin receptor (HTR1A/HTR1B families) and transporter 
(slc6a4a/b) gene expression in the zebrafish brain. J Comp Neurol. 2008; 511:521-542. [PubMed: 18839395] 


Choline acetyltransferase (CHAT): 
Hong E, Santhakumar K, Akitake CA, Ahn SJ, Thisse C, Thisse B, Wyart C, Mangin JM, Halpern ME. Cholinergic left-right 
asymmetry in the habenulo-interpeduncular pathway. Proc Natl Acad Sci USA. 2013; 110:21171-21176. [PubMed: 24327734] 
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As a collective group, to get a better idea of the 3 dimensional expression patterns for all three neuromodulators, we further 
referenced: 

P. Panula, Y.C. Chen, M. Priyadarshini, H. Kudo, S. Semenova, M. Sundvik, V. Sallinen. The comparative neuroanatomy and 
neurochemistry of zebrafish CNS systems of relevance to human neuropsychiatric diseases. Neurobiology of Disease 2010 
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Laboratory animals zebrafish larvae (6-8 dpf). Sexual differentiation has not occurred by this age. 
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All viruses require strategies to inhibit or evade the immune pathways of cells that 
they infect. The viruses that infect bacteria, bacteriophages (phages), must avoid 
immune pathways that target nucleic acids, such as CRISPR-Cas and restriction- 
modification systems, to replicate efficiently’. Here we show that jumbo phage OKZ 
segregates its DNA from immunity nucleases of its host, Pseudomonas aeruginosa, by 
constructing a proteinaceous nucleus-like compartment. OKZ is resistant to many 
immunity mechanisms that target DNA in vivo, including two subtypes of CRISPR- 
Cas3, Cas9, Cas12a and the restriction enzymes HsdRMS and EcoRI. Cas proteins and 
restriction enzymes are unable to access the phage DNA throughout the infection, but 
engineering the relocalization of EcoRI inside the compartment enables targeting of 
the phage and protection of host cells. Moreover, MKZ is sensitive to Cas13a—a 
CRISPR-Cas enzyme that targets RNA—probably owing to phage mRNA localizing to 
the cytoplasm. Collectively, we propose that Pseudomonas jumbo phages evade a 
broad spectrum of DNA-targeting nucleases through the assembly of a protein barrier 


around their genome. 


Phages that infect Pseudomonas aeruginosa can avoid destruction 
mediated by CRISPR by encoding ‘anti-CRISPR’ proteins that inhibit 
the type I-E and I-F CRISPR-Cas systems? *. To determine whether any 
P. aeruginosa phages are resistant to the P. aeruginosa type I-C CRISPR- 
Cas system? (a common, but understudied variant®), we engineered a 
strain of P. aeruginosa to express type I-C cas3, cas5, cas8 and cas7’, 
and provided this strain with a panel of CRISPR RNAs (crRNAs) that 
target phages from five taxonomic groups:JBD30, D3, JBD68 (distinct 
temperate siphophages), F8 and MKZ (distinct lytic myophages). All 
phages succumbed to targeting (Fig. 1a), except for MKZ (Fig. 1b) and 
arelated phage, ®PA3 (Extended Data Fig. 1a). The MKZ titre did not 
decrease when OKZ was challenged with 11 different type I-C crRNAs 
(Fig. 1b, Extended Data Fig. 2a), nor when it was targeted by the type I-F 
CRISPR-Cas system of P. aeruginosa’ (Extended Data Fig. 2b). 


Phage MKZ resists CRISPR-Cas targeting 


The ®KZ genome has no homologues of known anti-CRISPR (Acr) 
genes” **"°, or anti-CRISPR-associated genes, which have previously 
enabled identification of Acr genes*”?°. Moreover, gene knockout 
approaches have not been established for KZ. Therefore, to deter- 
mine the mechanism by which this phage can evade CRISPR-mediated 
destruction we used type II-A CRISPR-Cas9 from Streptococcus pyo- 
genes (SpyCas9). SpyCas9 and single-guide RNAs (sgRNAs) robustly 
targeted control phage JBD30, but phage replication and the associated 
lysis of host cells were unaffected during phage ®KZ infection (Fig. 1c). 
An additional six sgRNA sequences were also unable to impact OKZ 


replication (Extended Data Fig. 3a), as were four sgRNAs against MPA3 
(Extended Data Fig. 1b). Given the ability of this phage to evade unre- 
lated CRISPR systems (types I and II)-including one froma microorgan- 
ism that this phage does not infect (S. pyogenes)—we hypothesized that 
@KZ might be generally resistant to CRISPR-Cas immunity, as opposed 
to relying on specific inhibitor proteins. When we expressed the type 
V-A Cas12a (Cpfl) CRISPR-Cas system from Moraxella bovoculi in 
P. aeruginosa, the control phage was targeted successfully, but DKZ was 
not, with any of the ninecrRNAs tested (Fig. 1d, Extended Data Fig. 3b). 
The ability of MKZ to resist both CRISPR systems that are found in its 
natural host, Pseudomonas (types I-C and I-F), and those that are not 
naturally present in Pseudomonas (types Il-A and V-A) suggests that this 
phage has a mechanism that enables ‘pan-CRISPR ’ resistance. 
Restriction-modification (R-M) systems are the most common form 
of bacterial immunity in nature and pose a substantial impediment 
to phage replication’. We next tested type I and type II R-M systems 
(HsdRMS from P. aeruginosa and EcoRI from Escherichia coli, respec- 
tively; 24 and 92 cut sites in the ®KZ genome, respectively). ®KZ was 
propagated on the P. aeruginosa strain PAK, an isolate that generates 
phages that are susceptible to HsdRMS restriction when infecting the 
PAOI strain. When phage JBD30 (which has five typeI R-M sites) was 
assayed in this manner, its titre was reduced by five orders of magni- 
tude, which was dependent on hsdR (Fig. le). By contrast, no restric- 
tion was observed for MKZ (Fig. le) or MPA3 (Extended Data Fig. 1c). 
Similarly, the expression of EcoRI reduced the titre of JBD30 (6 EcoRI 
sites) by three orders of magnitude, but had no effect on KZ (92 EcoRI 
sites) (Fig. 1f). Together, these experiments demonstrate that OKZ is 
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Fig. 1| Identification of a phage that resists targeting by diverse CRISPR-Cas 
and R-Msystems. Plaque assays with the indicated phage spotted in 

tenfold serial dilutions ona lawn of P. aeruginosa; dark clearings inthe lawn 
represent phage replication. a—d, Strain PAO] expressing type I-C Cas genes 
(cas3, cas5, cas8, cas7) and crRNAs that target the indicated phages (a); type I-C 
Cas genes and crRNAs that target phage JBD30 or distinct crRNAs (1-3) that 
target phage MKZ (b); typell-A cas9 and distinct sgRNAs that target the 
indicated phages (c); or type V-A casi12a and distinct crRNAs against the 


refractory to our six selected CRISPR-Cas (types I-C, I-F, I-A, and V-A) 
and restriction endonucleases (types | and II) in vivo. 


Protein barrier occludes immune enzymes 


®MKZ and ®KZ-like phages that infect P. aeruginosa and Pseu- 
domonas chlororaphis have been shown to construct an elaborate 
proteinaceous nucleus-like compartment in which the replication of 
phage DNA takes place”. In addition, a phage-encoded homologue 
of tubulin, PhuZ, centres the compartment within the host cell”. 
Proteins that are involved in DNA replication, transcription and recom- 
bination localize inside the shell, whereas MRNA and proteins that 
mediate translation localize inthe cytoplasmic space!—similarly to the 
eukaryotic nucleus. Given the apparent exclusion of specific proteins, 
we hypothesized that this structure might be responsible for the pan- 
resistance of MKZ to distinct immune processes. 

P. aeruginosa cells that were infected with ®KZ were imaged with 
immunofluorescence to detect Cas9 (Fig. 2a, Extended Data Fig. 4). 
Wealso used live-cell imaging to detect Cherry fusions with Cas9, two 
signature proteins from the type I-C and I-F CRISPR-Cas systems (Cas8 
and Cas3; Fig. 2b) and the restriction enzyme HsdR (Fig. 2c). These 
experiments showed that the immune enzymes are excluded from the 
shell during phage infection. DAPI staining revealed that the phage DNA 
was located inside the shell, whereas the host genome was degraded". 
The phage protein ORF152 (imaged with anti-Myc immunofluores- 
cence and Cherry fusion) and host protein topoisomerase I (Cherry 
fusion)—both of which have been previously shown to be internalized 
inthe shell—colocalized with the DAPI signal, but Cherry was excluded 
(Fig. 2d). Although the rules that govern protein internalization in 
the shell are currently unknown, each protein of known function that 
localizes inside the shell interacts with DNA”” (that is, forms part of 
the DNA replication and transcription machinery), suggesting that the 


indicated phages (d). e, An endogenous type! R-M system (hsdRSM) in wild- 
type PAO1anda mutant strain with an isogenic hsdR knockout (PAO1Ahsdr) 
was assayed using phages that were propagated on PAK (for example, JBD30 
(PAK) was first propagated on strain PAK). f, The type II EcoRI R-M system in 
PAO1. Restriction activity was assayed using phages JBD30 and OKZ. pEcoRI 
denotes plasmid-expressed EcoRI. All plaque assays were replicated two or 
more times with similar results. 
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Fig. 2| CRISPR-Cas proteins and restriction enzymes are excluded fromthe 
nucleus-like structure of MKZ. a, Fluorescence microscopy of P. aeruginosa, 
immunostained for Cas9. The DAPI stain shows the phage DNA within the 
nucleus-like structure. b, c, Live-cell fluorescence imaging of P. aeruginosa 
strains that were engineered to express type II-A Cas9, type I-C or I-F Cas8 or 
Cas3 proteins fused to Cherry (b) or HsdR fused to Cherry (c). Scale bar, 1m. 
d, Top row, fluorescence microscopy of Myc-tagged ORF152 in P. aeruginosa, 
imaged with anti-Myc immunofluorescence. Bottom rows, live-cell 
fluorescence imaging of ORF152 and TopA proteins fused to Cherry, or Cherry 
alone. All experiments were replicated two or more times with similar results. 
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exclusion of DNA-binding Cas and restriction proteins is an adaptive 
function of the shell. 

Toconfirm that the KZ phage genome could bea substrate for DNA 
cleavage, if accessed, two enzymes that did not cleave MKZ in vivo, 
EcoRI and Cas9, were assayed in vitro. DKZ DNA was extracted from 
virions and subjected to restriction digestion reactions with a panel 
of restriction enzymes. EcoRI, HindIII, KpnI and Ncol all cleaved the 
DNA, whereas SaclI—which lacks a sequence-recognition motif in the 
KZ genome—did not (Fig. 3a, Extended Data Fig. 5a). The ability of 
Cas9 to cleave OKZ genomic DNA (gDNA) in vitro was next assessed. 
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Fig.3|®KZgDNAcan be cleaved by immune enzymes. a, OKZ andJBD30 
gDNA digested with the indicated restriction enzymes in vitro. The first lane 
contains a1-kb DNA ladder. The number of cut sites for each enzyme is shown at 
the bottom of the gels. Products were visualized ona 0.7% agarose gel stained 
with the DNA stain SYBR Safe. b, OKZ gDNA digested in vitro using KasI, and 
incubated with or without Cas9 loaded with crRNA:tracrRNA that targets the 
fragment liberated by KasI. Products were visualized ona 0.7% agarose gel 
stained with SYBR Safe. c, Plaque assays for strain PAO] expressing the fusion 
proteins EcoRI-Cherry, EcoRI-Cherry-ORF152 or EcoRI(E111G)—Cherry- 
ORF152. Plaque assays were conducted as in Fig. 1laand quantified (n=3). Data 
are mean +s.d. A Student’s t-test comparing the efficiency of plaquing of ODKZ 
on PAOt1 that expresses pEcoRI-Cherry to PAOI that expresses pEcoRI-Cherry- 
ORF152 yielded a Pvalue of 2.88 x 10“ (indicated with an asterisk). d, Growth 
curves measuring the optical density at 600 nm (ODgoonm) Of PAOL cells that 
were infected with ®KZ at the indicated multiplicity of infection (MOI). 

e, Live-cell fluorescence imaging of infected strains of P. aeruginosa that were 
engineered to express EcoRI-Cherry, EcoRI(E111G)—Cherry, EcoRI-Cherry— 
ORF152 or EcoRI(E111G)—Cherry-ORF152. The DAPI stain shows the phage DNA. 
The in vitro digestion experiments ina, b were replicated twice with similar 
results. The plaque assays (c), growth curves (d) and microscopy (e) were 
replicated three times. Scale bars, 111m. 


Owing tothe large size of the DKZ genome (280 kb), we first subjected 
purified phage DNA to the restriction enzyme Kas! to yield a 6.98-kb 
product, and then cleaved that species with a SpyCas9 nuclease loaded 
with both crRNA and trans-activating crRNA (tracrRNA) in vitro. This 
reaction depleted the substrate and produced the expected 4.9-kb and 
2.1-kb fragments, confirming that the phage gDNA is sensitive to Cas9 
cleavage (Fig. 3b). Together, these results demonstrate that immune 
enzymes are capable of cleaving ®KZ gDNA when they can access it, 
and that immune evasion is probably nota result of an intrinsic feature 
of the phage DNA (such as base modifications that can impede the 
Cascade-Cas3 complex, Cas9 and EcoRI)*”. 


Phage targeting via enzyme localization 


As there is no currently known method that can be used to mutate, 
weaken or knockout the phage shell structure, we next sought to enable 
an immune enzyme to bypass the shell and thereby access the phage 
DNA in vivo. We fused the single effector enzyme Cas9 to ORF152, a 
phage-encoded RecA-like protein that is internalized within the shell”. 
Independent fusions of Cas9 to the N or the C terminus of ORF152 did 
not affect the replication of OKZ (Extended Data Fig. 6a, b). Imaging 
of one orientation of ORF152-Cas9 revealed peri-shell localization 
(Extended Data Fig. 6c, d); this suggests that the fusion redirected 
Cas9 from its previously diffuse state but that the large Cas9 protein 
(1,368 amino acids, 158 kDa) was unable to traverse the shell border. 
Given the large size (for example, Cas12a) and complexity (for exam- 
ple, Cascade, HsdRMS) of each of the other immune effectors, we next 
fused the small, single effector protein EcoRI (278 amino acids, 31.5 kDa) 
to Cherry-tagged ORF152. This fusion resulted in a notable decrease 
of more than 2.5 orders of magnitude in the ®KZ titre and markedly 
reduced plaque sizes (Fig. 3c. Liquid infections revealed that cells that 
express EcoRI-Cherry-ORF152 were extremely well protected, gain- 
ing an increase of more than five orders of magnitude in their resist- 
ance to phage-induced lysis (Fig. 3d). A catalytic mutant EcoRI(E111G) 
fused to Cherry-ORF152 exhibited no immune activity against DKZ 
in either assay, and nor did active EcoRI-Cherry without the ORF152 
fusion (Fig. 3c, d). Similar results were observed in the absence of the 
Cherry tag (that is, EcoRI-ORF152); however, MKZ targeting was more 
modest in that case, reducing titres by approximately tenfold and pro- 
tecting cells by a factor of around 10* (Extended Data Fig. 7). Imaging 
of infected cells that express EcoRI-Cherry or EcoRI(E111G)-Cherry 
demonstrated that these proteins are excluded from the shell, whereas 
EcoRI(E111G)—Cherry—ORF152 is successfully localized inside (Fig. 3e; 
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Fig. 4| Phage ®KZ DNA is sensitive to RNA targeting with Cas13. a, Strain 
PAO1 expressing LseCas13a and distinct crRNAs that target the shell (one 
crRNA) or the head (two crRNAs) of phage OKZ. Plaque assays were conducted 
as in Fig. 1a. b, Growth curves measuring the ODgoonm Of PAOI cells that were 
infected with KZ at the indicated MOI.c, Wild-type (WT) and Cas13 escaper 
KZ phage, with deletions indicated with red dashed lines and target sites with 
blue lines. d, Live-cell fluorescence imaging of P. aeruginosa strains that were 


engineered to express LseCas13a and crRNAs that target OKZ. The cyan stain 
shows the DAPI-stained phage DNA. e, Summary model. The ®KZ nucleus-like 
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see Supplementary Video 1 for time-lapse images). EcoRI-Cherry- 
ORF152 impaired the ability of infected cells to form full shells and 
proceed through the infection process, and the EcoRI fusion protein was 
often localized within or adjacent to the DAPI-stained puncta (Fig. 3e, 
Supplementary Video 2). Some host DNA can be seen in these cells 
and infection does not proceed (Fig. 3e, Extended Data Fig. 7c, d). By 
bypassing the physical barrier of the shell using rational engineering, 
we conclude that the shell is the cause of the resistance to immunity 
that is shown by jumbo phage OKZ. 


Phage mRNA is sensitive to Cas13a 


The nucleus-like structure produced by ®KZ provides robust resist- 
ance to DNA-targeting immune pathways, but other immune systems 
may exist in vivo that can evade this mechanism. We envisaged that 
the mRNA that is exported for translation in the cytoplasm is probably 
susceptible to targeting and that a ribonuclease could provide anti-OKZ 
immunity. To test this, we adapted the type VI-A CRISPR RNA-guided 
RNA nuclease Cas13a from Listeria seeligeri (LseCas13a) *°” for phage 
targeting in P. aeruginosa. Three LseCas13a spacers (two targeting the 
head gene gp146 and one targeting the shell gene gp054) decreased the 
efficiency of OKZ plaquing by more than 10° fold (Fig. 4a, Extended 
Data Fig. 8). Corroborating these results, LseCas13a was also effective 
at protecting P. aeruginosa in liquid cultures, with an increase of 1-2 
orders of magnitude in resistance to phage-induced lysis (Fig. 4b). We 
suspect that the previously demonstrated collateral damage caused by 
Cas13a” is detrimental to cell fitness, which leads to an enhancement 
of the growth curves that is modest compared to that brought about 
by EcoRI targeting. Cas13 escaper phages (that is, phages that evaded 
Cas13-mediated targeting) were also identified (Extended Data Fig. 9); 
these occurred at a frequency of around 10° to 10” and, surprisingly, 
contained genomic deletions at the site of targeting (Fig. 4c). Notably, 
all deletions were in-frame and five out of six escaper phages had clear 
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structure excludes Cas9, Cas12, Cascade-Cas3 (typel-C andtypel-F), typel 
restriction endonucleases (REase) and type Il restriction endonucleases (for 
example, EcoRI); however, the mRNA (red) is exported and can be targeted by 
Cas13. The nucleus-like structure is resistant to the indicated nucleases, but 
fusing EcoRI to the internal protein ORF152 enables targeting. The assays in 
awere replicated more than three times with similar results. The growth curve 
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(c) were isolated once and verified by PCR, sequencing and plaque assays. 
Microscopy (d) was replicated twice with similar results. 


EcoRI EcoRI 
fusion 


VAVON a 
Phage DNA 


regions of microhomology flanking the deletion that ranged from 
7-21bp in length. One guide RNA targeting the 5’ end of orf146 selected 
for deletions in which the upstream gene (orf145) became fused, in 
frame, to the targeted gene (Fig. 4c). These data support the conclu- 
sion that sequence-specific RNA targeting by CRISPR-Cas proteins 
can inhibit this phage. 

Before the construction of the shell takes place, the state of the phage 
DNA is unknown. During Cas13a targeting of the shell mRNA, imaging 
revealed that infections arrested before the phage DNA proceeded from 
its injection site at the poles. Most cells had two polar puncta, even one 
hour after infection (Fig. 4d). The absence of diffusion or clearance of 
the phage DNA (for example, by the endogenous typeI R-M system) 
suggests that the injected phage genome may be protected before 
shell assembly by injected phage proteins or pre-existing host factors. 
The potential for a DNA structure to exist prior to shell constructionis 
further corroborated by our observation that in the absence of Cas13a- 
mediated arrest, DAPI-stained phage DNA was detected adjacent to 
EcoRI-Cherry—ORF152 puncta (Fig. 3e) or adjacent to a nascent shell 
in which TopA was localized (Extended Data Fig. 10). The molecular 
nature and mechanisms underlying this early protection event remain 
to be elucidated. 


Discussion 


The assembly of a proteinaceous compartment in which phage DNA 
replication occurs creates a physical protective barrier that results 
in the resistance of phage ®KZ to enzymes that cleave DNA (Fig. 4e). 
The evasion of endogenous P. aeruginosa type | CRISPR-Cas systems 
by MKZ suggests that these jumbo phages are probably resistant to 
type 1 CRISPR-Cas systems in nature. Supporting this hypothesis, 
our analysis of a previously published collection of over 4,000 non- 
redundant P. aeruginosa spacers’ (types|I-C, I-Eand I-F) found no spacers 
against KZ or its jumbo phage relatives ®PA3, PaBG, KTN4 and PA7 


Nature | Vol577 | 9 January 2020 | 247 


Article 


Table 1| No spacers against DKZ-like phages 


Phage No.ofspacers TypelCRISPRsensitivity Reference 
DMS3m 75 I-C: sensitive; I-E: resistant This study and 
(AcrlE3); I-F: sensitive previous studies*® 
JBD30 51 I-C: sensitive; I-E: sensitive; This study and 
|-F: resistant (AcrIF1) previous studies”* 
JBD18 51 |-F: sensitive A previous study® 
JBD25 46 |-F: sensitive A previous study® 
JBD68 28 I-C: sensitive This study 
D3 49 I-C: sensitive This study 
F8 3 I-C: sensitive This study 
OKZ 0) I-C: resistant; I-F: resistant This study 
phiPA3 (0) I-C: resistant This study 
PaBG (0) not assayed 
KTN4 (0) not assayed 
PA7 (0) not assayed 


The total number of type | CRISPR spacers with a perfect match to the indicated P. aeruginosa 
phages assayed in this study and previous CRISPR-Cas studies. The experimental sensitivity 
of each phage to the indicated subtypes is shown. AcrlE3 and AcrIF1 are I-E and I-F anti- 
CRISPR proteins, respectively. For F8, the 3 spacers have mismatches (4 nucleotides or fewer) 
to the F8 genome. 


(Table 1). By contrast, many spacers from each system match diverse 
P. aeruginosa phages, such as those assayed in our screen (Fig. 1) and 
those encoding anti-CRISPR proteins. In addition, given the efficacy 
of the RNA-targeting CRISPR-Cas13 system, we propose that type VI 
CRISPR systems are well suited to target the mRNA of DNA phages 
when the DNA is inaccessible (that is, owing to base modifications or 
physical segregation). 

Although this phage compartment has only been documented 
among the jumbo phages of Pseudomonas", physical occlusion of 
phage gDNA through this and other mechanisms may constitute a route 
through whichimmune systems can be evaded by other phages. Indeed, 
‘mega phages’ have recently been reported to encode homologues 
of phage tubulin, which in Pseudomonas are involved in centring the 
proteinaceous compartment during infection”®. The pan-resistance 
of ®KZ to enzymes that target DNA provides an explanation for the 
elaborate, nucleus-like structure. Furthermore, the polar localization 
of the injected phage DNA during mRNA targeting suggests a poorly 
understood early protective mechanism. Considering the pronounced 
resistance to overexpressed immune enzymes and the previously 
observed docking of capsids at the shell periphery"*, we propose 
that the phage DNA is never exposed to the cytoplasm (Fig. 4e). Other 
hypotheses to explain the importance of the shell—including protection 
from phage-derived nucleases that degrade the bacterial genome, and 
spatial organization of the large phage genome during replication and 
packaging—remain to be addressed. Regardless, we conclude that the 
phage-assembled nucleus-like structure provides a strong protective 
barrier to DNA-targeting immune pathways. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized, and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Bacterial growth and genetic manipulation 

The strains, plasmids, phages and spacer sequences used in this study 
are listed in the Supplementary Information. P. aeruginosa strain PAO1 
was grown in LB at 37 °C with aeration at 225 rpm. When necessary, 
plating was performed on LB agar with carbenicillin (250 pg ml“) or 
gentamicin (50 pg ml”). Gene expression was induced by the addition of 
L-arabinose (0.1% final) and/or isopropyl B-D-1-thiogalactopyranoside 
(IPTG; 0.5 mM or 1mM final). For chromosomal insertions at the attTn7 
locus, P. aeruginosa cells were electroporated with the integrating vec- 
tor pUC18T-lac and the transposase expressing helper plasmid pTNS3, 
and selected on gentamicin. Potential integrants were screened by 
colony PCR with primers PTn7R and PglmS-down. Electrocompetent 
cell preparations, transformations, integrations, selections, plasmid 
curing and FLP-recombinase-mediated marker excision with pFLP were 
performed as described previously”. 


Phage growth and DNA extraction 

Phage growth was conducted in LB at 37 °C with PAO1as a host. Growth 
curves were conducted in a Biotek Synergy plate reader at 37 °C with 
orbital shaking set to maximum speed. Phage stocks were diluted, 
stored in SM buffer’ and used for routine plaquing assays. Plaque 
assays were conducted at 37 °C with 20 ml of bottom agar containing 
10 mM MgSO, and 0.35% or 0.7% top agar (often both concentrations 
were used in parallel) also containing 10 mM MgSO, and any inducer 
molecules. Spots of 3 pI were applied to the top agar after it had been 
poured and solidified. For high-titre lysates to generate phage DNA, 
plates with a high number of plaques were flooded with SM buffer 
and collected®. The lysates were subsequently treated with DNase. 
Phage DNA was extracted with the Wizard Genomic DNA Purification 
Kit (Promega). DNA restriction assays were performed according to 
standard NEB protocols and restriction fragments were assessed by 
agarose gel electrophoresis. 


Type I-C CRISPR-Cas system assay and expression in 

P. aeruginosa PAO1 

Type I-C CRISPR-Cas function was tested by electroporating a strain 
that naturally contains I-C Cas genes (strain F11) with pHERD30T 
plasmids encoding crRNAs that target phages. To express this sys- 
tem heterologously in PAOI, the four effector Cas genes (cas3, cas5, 
cas8 and cas7) were cloned into pUCI8T-lac and inserted in the PAO1 
chromosome as described above. After removal of the gentamicin 
marker, this strain was electroporated with the same pHERD30T 
crRNA-encoding plasmids to confirm function after induction with 
IPTG and arabinose. 


Type I-F CRISPR-Cas system expression in P. aeruginosa PAO1 

To express the type I-F system heterologously in PAOI, all I-F Cas 
genes (cas1,cas3, csy1, csy2, csy3 and csy4) were cloned into PMMBHE 
plasmids and transformed into PAO1. Subsequently, this strain was 
electroporated with the pHERD30T crRNA-encoding plasmids to con- 
firm function after induction with IPTG and arabinose. To maintain 
PHERD30T and pMMBHE in the same strain of P. aeruginosa, double 
selection of 30 pg ml“ gentamicin and 100 pg mI carbenicillin was 
used. 


crRNA cloning and expression 

All crRNAs used here were cloned into established entry vectors in 
the pHERD30T background. After removing a pre-existing Bsal cut 
site in the vector by mutagenesis, a pseudo-CRISPR array (that is, 


repeat-spacer-repeat for types I, Vor VI, or ansgRNA scaffold for type 
II) in which the spacing sequence possessed two inverted Bsal digest 
sites was then cloned into the vector, to facilitate scarless cloning of 
new spacers. Desired spacer sequences were chosen randomly across 
the phage genome (with the correct PAM sequence for the cognate 
CRISPR-Cas system) and ordered as two complementary oligonucleo- 
tides that generate sticky ends when annealed, to be cloned into the 
entry vector, which was digested with Bsal. Spacer oligonucleotides 
were treated with polynucleotide kinase (PNK), annealed and ligated 
into the entry vector. 


SpyCas9 and sgRNA expression in P. aeruginosa 

The S. pyogenes cas9 gene was cloned into a pUCI18T-Ara integration 
vector and then inserted into the attTn7 locus of PAO1. AnsgRNA scaf- 
fold was constructed based ona previous design” with internal Bsal 
cut sites to enable insertion of pre-annealed oligos for scarless sgRNA 
design. This sgRNA scaffold was amplified with primers p30T-gRNA_ 
Bsal and p30T-gRNA Bsal_rev. The resulting product was inserted into 
the pHERD30T vector by Gibson assembly after backbone (pJW1) ampli- 
fication by inverse PCR with primers gRNA Bsal-p30T and gRNA Bsal- 
p30T-rev. The sgRNA scaffold was positioned into pJW1 so that after 
Bsal cleavage the spacer insert +1 position would conincide with the 
pBAD transcription start site (TSS) +1 position. The resulting plasmid, 
pJB1, was digested with Bsal (NEB), followed by ligation of indicated pre- 
annealed oligonucleotides. Supplementary Table 3 contains a complete 
list of all target sequences. The sequence of the sgRNA construct with 
Bsal site locations is shown in Supplementary Table 3. 


Cas9 in vitro cleavage 

Cas9-based cleavage of the phage genomein vitro was conducted with 
purified Cas9 protein (NEB, M0386S), and the cleavage reaction based 
on Cas9-gRNA-tracrRNA was then performed according to the manu- 
facturer’s instructions (NEB). Cas9 crRNAs (Supplementary Table 3) 
were ordered as Alt-R CRISPR-Cas9 crRNAs from IDT and used with- 
out further modification. The tracrRNA was amplified using primers 
tracrRNA-FOR and tracrRNA-REV froma plasmid (pBR62). The tracrRNA 
was produced througha T7 RNAP reaction using dsDNA that encodes 
the tracrRNA downstream ofa T7 RNAP promoter. Cas9 protein (NEB) 
was combined with pre-annealed crRNA and tracRNA complex ata 1:1 
molar ratio. For targeting Kasl-liberated OKZ gDNA, 500 ng of OKZ DNA 
was co-incubated with Cas9 ribonucleoprotein (RNP), which cleaves at 
position 183,270, and Kasl in NEB buffer 3.1 for 1h. After stopping the 
reaction by proteinase K treatment, products were assessed by agarose 
gel electrophoresis. For Cas9 digestion of whole DNA, the reaction 
was performed at 37 °C for 4h with 300 ng of ®KZ or DMS3 gDNA and 
the products were assessed by agarose gel electrophoresis. Two Cas9 
guides were selected that cleave at positions 158,649 and 168,715 of the 
KZ genome to liberate a fragment of approximately 10 kb. 


Cas12a and crRNA design for expression in P. aeruginosa 

The humanized allele of the cpf1 gene of M. bovoculli (MBO_03467, 
KDN25524.1) was sub-cloned from pTE4495 (Addgene) into pUCI8T- 
lac using primers pUC_cpfl_F and pUC_cpf1 _R and inserted in the 
PAO1chromosomeas described above. A Cpfl repeat-spacer-repeat 
pseudo-CRISPR array was synthesized as oligonucleotides, annealed, 
ligated into a pHERD30T vector and digested with Ncol and Hindlll. 
Spacer sequences were cloned into the resulting vector (pJB2) after 
digestion with Bsal and ligation of pre-annealed spacer oligonucleo- 
tide pairs. 


Cas13a and crRNA design for expression in P. aeruginosa 

The wild-type allele of the cas13 gene of L. seeligeri and Leptotrichia 
shahiiwas sub-cloned from p2CT-His-MBP-Lse_C2c2_WT and p2CT-His- 
MBP-Lsh_C2c2_WT (Addgene) into pUCI8T-lac. LseCas13 and LshCas13 
were inserted in the PAOI chromosomeas described above. An/se- and 
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anLshCas13a repeat-spacer-repeat pseudo-CRISPR array were synthe- 
sized as oligonucleotides, annealed, ligated into a pPHERD30T vector 
and digested with Ncol and EcoRI. Spacer sequences were cloned into 
the resulting vectors (pSDMO057 and pSDMO70, respectively) after Bsal 
digestion and ligation of pre-annealed spacer oligonucleotide pairs. 
crRNA expression vectors were introduced into PAOI tn7::cas13' and 
PAO1 tn7::cas13"*". The resulting strains were assayed for phage sen- 
sitivity under standard phage plating conditions, with induction of 
both Cas13 and the gRNAs (50 pg mI gentamicin, 0.1% (L)-arabinose, 
1mMIPTG). 


EcoRI expression in P. aeruginosa 

The wild-type alleles of the M-EcoRI (methyltransferase) and R-EcoRI 
(endonuclease) genes were sub-cloned from pSB1A3 EcoRI Methylase- 
AmilCP and pSB1A3 EcoRI-RTX with EcoRI Methylase-AmilCP (Addgene 
plasmid 85166 and 85165), p2CT-His-MBP-Lse_C2c2_WT and p2CT-His- 
MBP-Lsh_C2c2_WT (Addgene) into pHERD30T using Gibson assembly. 
The resulting plasmids, pSDM160 and pSDMI161, were electroporated 
into PAOI AhsdR (SDMO20). The resulting strains were assayed for 
phage sensitivity under standard phage plating conditions, with induc- 
tion of both M-EcoRI (for genome protection) and R-EcoRI (50 pg mI 
gentamicin, 0.1% (L)-arabinose). 


Restriction-modification assay 

PAOI, PAK and PAO] AhsdR were grown to saturation in LB at 37 °C. 
Then, 4 ml of 0.7% molten top agar with 10 mM MgSO, were seeded 
with 100 pl saturated culture and spread on 20 ml 10 mM MgSO, LB 
agar plates. Tenfold serial dilutions of bacteriophage JBD30 and ®KZ 
propagated on strain PAO] and PAK were spotted (2.5 pl) on plates. 
Plates were incubated at 37 °C overnight and imaged the following day. 


Chromosomal knockout of hsdR 

To delete hsdR (NP_251422.1) from the PAO1 genome, a PAO1 strain that 
expresses Cas9 from S. pyogenes was programmed to express an sgRNA 
against hsdR: GCCCTCATCGAAGAAACCAG. In addition, the pHERD30T 
plasmid expressing this sgRNA was engineered to carry a repair tem- 
plate. This repair template consisted of the 500 bp upstream of hsdR 
and the 500 bp downstream of hsdR directly enjoined to one another. 
Induction of Cas9 and sgRNA led to cellular toxicity, as measured by 
ODgoo0nm (data not shown). Survivors of Cas9-mediated targeting were 
isolated. The hsdR locus of survivors was then amplified by colony 
PCR with primers binding outside of the region encompassed by the 
repair template. Products were resolved by gel electrophoresis in a1.0% 
agarose TAE gel at 100 V and visualized using the SYBR Safe DNA stain. 
Amplicons of a reduced size were then sequenced, which confirmed 
the chromosomal deletion of hsdR. A clone with the correct deletion, 
SDMO20, was chosen for downstream experiments. 


Construction of fusion proteins 

Plasmids expressing Cherry alone, Cherry-Cas3, or Cherry-Cas8 (of 
type I-C and type I-F systems) and Cherry-TopA were constructed by 
Gibson assembly inthe pHERD30T plasmid digested with Sacl and Pstl. 
These fusions have aGGAGGCGGTGGAGCC nucleotide (G-G-G-G-A 
amino acid) linker sequence in between them. cherry was amplified 
from SF-pSFFV-sfCherryFLIM3_TagBFP (provided by the laboratory of 
B. Huang, UCSF). cas3 and cas8 of the type I-C and type I-F systems were 
amplified from LL77 and PA14, respectively. topA was amplified from 
gDNA of PAO1. All of the primers that were used for the construction 
of the plasmids are listed in Supplementary Table 3. 

Plasmids expressing Cherry—Cas9 (pESN28) (primers: prESN74, 
prESN75, prESN76, prESN77) and Cherry—Cas9-ORF152 (pESN29) 
(prESN80O, prESN81, prESN82, prESN83) were constructed by Gib- 
son assembly in the pHERD30T plasmid. Cherry-ORF152 (pESN32) 
(prESN91, prESN92) was constructed by PCR amplification to omit 
Cas9 and ligated using aKLD reaction. An SGAAAAGGSQK amino acid 


linker connects Cherry to the other proteins. AGGGGS amino acid linker 
connects Cas9 and ORF152. The plasmid expressing cMyc-ORF152 
(prESN153, prESN154) was constructed by PCR amplification (primers: 
prESN153 and prESN154) using the p30-Cas9-ORF152 plasmid as the 
template, followed by a PCR clean-up and KLD ligation to circularize. 
This removed Cas9 from the plasmid, and added cMyc. All plasmids 
were designed using SnapGene. 

To construct translational fusions of proteins, desired gene frag- 
ments including cas9, M-EcoRI, R-EcoRI, orf152 (NP_803718.1) and 
sfCherry2 were amplified by PCR from templates pHERD30T::Cas9, 
pSB1A3::M.EcoRI-AmilCP, pSB1A3::M.EcoRI-AmilCP;R.EcoRI-RTX, DKZ 
gDNA and pHERD30T::sfCherry2, respectively. PHERD30T was lin- 
earized by restriction digestion with Sacl-HF and PstI-HF or by PCR. PCR 
primers included 15-30-bp overhangs for Gibson assembly. Overhangs 
that enjoin genes included amino acid linkers GGGGS or GGSGGS. PCRs 
were treated with DpnI to eliminate template DNA. All PCRs and restric- 
tion digests were purified using a PCR clean-up kit (Zymo Research DNA 
Clean and Concentrator Kits). Linearized vector and gene fragments 
were treated with Gibson assembly, and Gibson assembly reactions were 
transformed into competent £. coli. Plasmid products were isolated by 
minipreparation and submitted for sequencing. Correct assemblies 
were electroporated into appropriate P. aeruginosa strains. 


Isolation of Cas13 escapers 

For identifying escapers of Cas13a RNA targeting, 3 pl of high-con- 
centration MKZ lysates was mixed with 150 pl of overnight cultures of 
SDMO078, SDM109 or SDM107 (strains expressing Cas13a and a gRNA 
against KZ). After incubating at 37 °C for 10 min, samples were mixed 
with 4 ml of 0.7% agar, 10 mM MgSO,, 1 mM IPTG and 0.1% arabinose and 
plated in LB agar plates with gentamicin (50 pg ml”). After overnight 
incubation, plates were examined for the presence of escaper plaques. 
Escapers were formed in SDMO78 and SDM109, but notin SDM107. Ten 
phages that escaped targeting from SDMO78 and SDM109 were purified 
and the protospacer locus was amplified using PCR and subsequently 
sequenced. Six unique outcomes were identified and shown in Fig. 4. 


Growth curve experiments 

Growth curve experiments were carried out ina Synergy H1 microplate 
reader (BioTek, with GenS software). Cells were diluted 1:100 froma 
saturated overnight culture with 10 mM MgSO, and antibiotics and 
inducers, as appropriate. Diluted culture (140 pl) was added together 
with 10 pl of phage to wells in a 96-well plate. This plate was cultured 
with maximum double orbital rotation at 37 °C for 24 h with ODgoonm 
measurements every 5 minutes. 


Immunofluorescence 

Sample growth. Overnight cultures (5 ml) of astrain expressing Cas9 
and ansgRNA targeting ®KZ (SDM065), and astrain expressing cMyc- 
ORF152 (bESN27), were grown at 30 °Cin LB medium with gentamicin. 
A1:30 back dilution of the overnight culture into LB was grown at 30 °C 
for 1h. Protein and guide expression was induced with 0.1% arabinose 
and 0.5 mM IPTG, respectively. After 1 h of expression, an aliquot of 
uninfected cells was fixed and the remaining cultures were infected 
with OKZ using a MOI of 1.5. Infected cell aliquots were collected and 
fixed at 60 min post-infection. 


Fixation. This protocol was adapted froma previous study”’. Samples 
were fixed with 5x fixing solution (12.5% paraformaldehyde, 150 mM 
KPO, pH 7.2) and incubated for 15 min at room temperature, followed 
by 20 min on ice. Samples were then washed in PBS three times and 
finally resuspended in GTE (50 mM glucose, 10 mM EDTA pH 8.0, 20 mM 
Tris-HCl pH 7.65) with 10 ppg ml“ lysozyme. Resuspended cells were 
transferred to polylysinated coverslips and dried. Once dry, the cov- 
erslips were incubated in cold methanol for 5 min and cold acetone 
for 5 min. Cells were rehydrated by a rinse in PBS, followed by a 3-min 


incubation in PBS with 2% BSA blocking solution. Cells were incubated 
with a1:50 dilution of primary antibody (Cas9 (7A9-3A3): sc-517386 or 
cMyc (9E10): sc-40) in PBS with 2% BSA for 1h, followed by 3 washes of 
7 min each in fresh PBS with 2% BSA. Coverslips were then incubated in 
the dark for 1h with secondary antibody (goat anti-mouse Alexa Fluor 
555, Life Technologies A-21424) diluted 1:500 in PBS with 2% BSA. DAPI 
was added for the final 10 min of the incubation. Cells were washed 
in PBS 3 times for 7 min. Coverslips were then placed on slides using 
mounting medium (v/v 90% glycerol, v/v 10%Tris pH 8.0 and w/v 0.5% 
propyl-gallate) and sealed with clear nail polish. 


Microscopy and analysis. Images were collected using a Zeiss Axiovert 
200M microscope, and were processed using NIS Analysis software at 
the UCSF Nikon Imaging Center. Compartments and cells were manu- 
ally selected using the Simple ROI Editor. Background subtractions 
were conducted for each cell and compartment separately. The cor- 
rected intensities for the cytoplasmic and compartment regions were 
averaged over the area of each cell and compartment using Matlab and 
plotted using GraphPad Prism 6. Data from two pooled replicates were 
fitted with a line, with 95% confidence intervals shown as dashed lines. 
The slope is as reported in the plots. 


Live-cell imaging 

For live-cell imaging of OKZ infection, freshly grown cells from LB plates 
were picked and resuspended in 100 ul of LB medium. Around 5-10 
pl of the samples was spotted on 0.85% agarose pads with 1:5 diluted 
LB, arabinose (0.01-0.05%) and DAPI (2 pg mI). Samples were then 
incubated ina humidified chamber and allowed to grow for 3 hat 30 °C. 
KZ lysate (5 pl)was spotted on top of the agar pad and the samples 
were grown for an additional 1h. Cells were visualized in the microscope 
after being covered witha cover slip. A Nikon Ti2-E inverted microscope 
equipped with the Perfect Focus System (PFS) and a Photometrics 
Prime 95B 25-mm camera were used for live-cell imaging. Images were 
processed using NIS Elements AR software. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All data generated or analysed during this study are included in this 
published article and its Supplementary Information files. 
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that target the indicated phages. c, The endogenous typeI R-Msystem 
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Extended Data Fig. 2 | Phage @KZ resists P. aeruginosa typeI-C and type I-F 
CRISPR-Cas immunity. a, Strain PAOI was engineered to express the typeI-C 
Cas genes and distinct crRNAs that target phage JBD30 and phage OKZ.b, 
Strain PAOI was engineered to express the type I-F Cas genes and distinct 
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crRNAs that target phage JBD30 and phage KZ. All plaque assays were 
conducted as in Fig. laand were replicated two or more times with similar 
results. 
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Extended Data Fig. 3| Phage @KZ resists targeting by heterologoustypeII-A 
and V-A CRISPR-Cas systems. a, Strain PAOI was engineered to express the 
typell-A Cas9 protein and distinct sgRNAs that target the indicated phages. 

b, Strain PAOI was engineered to express the type V-A Cas12a protein and 


distinct crRNAs against the indicated phages. All plaque assays were 
conducted as in Fig. 1a and were replicated two or more times with similar 
results. 
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Extended Data Fig. 4 | Quantification of Cas9 localization during phageMKZ _ _ (individual cells) fromtwo pooled replicate experiments are fitted with aline 
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Products were visualized ona 0.7% agarose gel stained with SYBR Safe. The 
and subjected to digestion with the indicated restriction enzymes in vitro. The experiment was replicated twice with similar result. 
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Extended Data Fig. 6 | Cas9 protein fused to ORF152 localizes tothe 
periphery of the shell, but does not enable immune activity against OKZ. 
a, Strain PAOI was engineered to express Cas9 fused to ORF152 at either the N 
or the C terminus, with sgRNAs targeting the indicated phages. b, Growth 
curves measuring the OD¢oonm Of PAOI cells that were infected with OKZ at the 
indicated MOI.c, Fluorescence microscopy of PAO1 fusion strains, 
immunostained for Cas9, in cells that express ORF152-Cas9 or Cas9-ORF152. 
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d, Fluorescence microscopy of P. aeruginosa, immunostained for Cas9, in cells 
that express ORF152-Cas9. The DAPI staininc, d shows the phage DNA within 
the shell. All plaque assays were conducted as in Fig. 1a and were replicated four 
times with similar results. The growth curve experiments (b) were replicated 
three times with similar results. Microscopy (c, d) was replicated twice with 
similar results. 
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Extended Data Fig. 7 | Fusion of EcoRI restriction enzymes to ORF152 
enables immune activity. a, Strain PAO] was engineered to express EcoRI or an 
EcoRI-ORF152 fusion protein. Plaque assays were conducted as in Fig. 1aand 
quantified (n=3).b, Growth curves measuring the OD goo nm Of PAOI cells that 
were infected with ®KZ at the indicated MOI.c, d, Live-cell fluorescence 
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imaging (with a wide field of view in d) of P. aeruginosa strains that were 
engineered to express EcoRI or EcoRI-ORF152. The DAPI stain shows the phage 
DNA. AII plaque assays were replicated three times with similar results. The 
growth curve experiments (b) were replicated twice with similar results. 
Microscopy (c, d) was replicated twice with similar results. Scale bar, 5 um. 
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Extended Data Fig. 8 | Phage KZ is sensitive to RNA targeting with Cas13. 
a, Strain PAOI] expressing LseCas13a and distinct crRNAs that target the 
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target the indicated genes. All plaque assays were conducted as in Fig. 1a and 
were replicated twice with similar results. 
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Extended Data Fig. 9 | Phage ®KZ escaper mutants are selected by Cas13- 
mediated RNA targeting. Strain PAO] expressing LseCas13a and acrRNA that 
targets the indicated gene. Plaque assays were conducted as in Fig. la using 
wild-type and escaper-mutant KZ. Wild-type ®KZ is targeted by both strains 
and the faint clearings observed here are not observed as plaques ina full-plate 
assay. All plaque assays were replicated twice with similar results. 
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Extended Data Fig. 10 | Observation of DAPI-stained phage DNA adjacent to infected with OKZ. The DAPI stain labels DNA. Microscopy was replicated three 
anascent shell that contains Cherry-TopaA. Live-cell fluorescence imaging of times with similar results. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 
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The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 
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Software and code 


Policy information about availability of computer code 


Data collection Pictures of plaque assays and gels were taken using Image Lab (TM) Version 6.0.1 (C) 2017, Bio-Rad Laboratories, Inc. Growth curve 
assays were collected using BioTek Synergy H1 software Gen5 3.05.11. Microscopy was collected using NIS Elements-AR software 
5.0.2.00. 

Data analysis Pictures of plaque assays and gels were edited using Image Lab (TM) Version 6.0.1 (C) 2017, Bio-Rad Laboratories, Inc. and ImageJ 1.50i. 


Plate reader growth curves were plotted using Prism 6. Plaque assay quantification was plotted using Prism 6. For Cas9 exclusion 
quantification, cells and compartments were chosen using Simple ROI Editor, from NIS Elements-AR 5.0.2.00. Background subtraction 
and average intensities were performed using MatLAB R2018b 9.5.0.944444 and plotted using Prism 6. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


Included in main text. 


= 
fev) 
a 
S 
= 
o 
= 
o 
Nn 
o 
fev) 
= 
a 
= 
= 
o 
O) 
(S) 
a 
= 
a 
Nn 
S 
S| 
= 
fev) 
5 
< 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 
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Life sciences study design 
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Sample size For I-C, I-F, Il-A, V-A, and VI-A CRISPR and type | and type II restriction enzyme plaque assays, n>=2 as output of targeting was either full 
plaque formation or complete protection and thus was not quantified. For II-A and VI-A CRISPR growth curve assays, n=3. For EcoRI Fusion 
plaque assays, n=3. For EcoRI Fusion growth curve assay, n=3. For microscopy, one sample was imaged and representative cells are 
demonstrated. For immunfluorescence quantification, two replicates were imaged and selected cells were chosen (n=30 for Replicate 1, n=20 
for Replicate 2) and analyzed. 
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Data exclusions | Gels in Figure 3 and Extended Figure 5 were cropped for clarity and conciseness. Full gels are provided in a .pdf. 


Replication Plaque assays and growth curves were all performed at least twice, showing replication each time. Here we show representative examples. 
Microscopy data shown is representative examples of phenotypes observed throughout the sample(s). 


Randomization | Samples were organized into experimental groups depending on genotype, including genes and crRNAs encoded by the bacterial strain. 
Randomization was not performed in this study. 


Blinding Blinding was not performed in this study, as it was not relevant given that all results are quantitative, or represent binary states (e.g. exclusion 
vs. inclusion). 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


| Animals and other organisms 
| Human research participants 


[| Clinical data 


Antibodies 


Antibodies used Santa Cruz Biotechnology, Inc. Cas9 (7A9-3A3): sc-517386, Santa Cruz Biotechnology, Inc cMyc Antibody (9E10): sc-40 


Validation Cas9, from the manufacturer's website: "Yes, this mouse monoclonal antibody, Cas9 (7A9-3A3): sc-517386, has been validated 
to work for immunofluorescent staining applications. Unfortunately, we do not yet have any published IF data for this clone on 
our website. However, this clone, (7A9-3A3) has been widely published in the literature for use in a variety of applications, 
including IF." 
cMyc: There are 5553 citations of this antibody from Santa Cruz in the literature and several examples of both western blots and 
IF on the product website. But they do not specifically address validation of it anywhere. 
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A subset of children with autism spectrum disorder appear to show an improvement 
in their behavioural symptoms during the course of a fever, a sign of systemic 
inflammation’”. Here we elucidate the molecular and neural mechanisms that 
underlie the beneficial effects of inflammation on social behaviour deficits in mice. We 
compared an environmental model of neurodevelopmental disorders in which mice 
were exposed to maternal immune activation (MIA) during embryogenesis** with 
mouse models that are genetically deficient for contactin-associated protein-like 2 
(Cntnap2)’, fragile X mental retardation-1 (Fmr1)° or Sh3 and multiple ankyrin repeat 
domains 3 (Shank3)’. We establish that the social behaviour deficits in offspring 
exposed to MIA can be temporarily rescued by the inflammatory response elicited by 
the administration of lipopolysaccharide (LPS). This behavioural rescue was 
accompanied by a reduction in neuronal activity in the primary somatosensory cortex 
dysgranular zone (S1DZ), the hyperactivity of which was previously implicated in the 
manifestation of behavioural phenotypes associated with offspring exposed to MIA®. 
By contrast, we did not observe an LPS-induced rescue of social deficits in the 
monogenic models. We demonstrate that the differences in responsiveness to the LPS 


treatment between the MIA and the monogenic models emerge from differences in 
the levels of cytokine production. LPS treatment in monogenic mutant mice did not 
induce amounts of interleukin-17a (IL-17a) comparable to those induced in MIA 
offspring; bypassing this difference by directly delivering IL-17a into SIDZ was 
sufficient to promote sociability in monogenic mutant mice as well as in MIA offspring. 
Conversely, abrogating the expression of IL-17 receptor subunit a (IL-17Ra) inthe 
neurons of the SIDZ eliminated the ability of LPS to reverse the sociability phenotypes 
in MIA offspring. Our data support a neuroimmune mechanism that underlies 
neurodevelopmental disorders in which the production of IL-17a during inflammation 
can ameliorate the expression of social behaviour deficits by directly affecting 
neuronal activity in the central nervous system. 


The beneficial effects of infection and the ensuing inflammation on 
neurological disorders have previously been noted’. For example, a 
subset of children with autism spectrum disorder (ASD) exhibit tem- 
porary but considerable improvements of their behavioural symptoms 
during episodes of fever, a sign of systemic inflammation’. How- 
ever, amechanistic understanding of how fever-associated immune 
responses translate into behavioural relief—both at the molecular 
and neural level—is lacking. The aetiology of ASD includes both envi- 
ronmental and genetic risk factors’®. Mouse models for ASD, in which 
mice contain mutations in ASD risk genes (including Cntnap2°, Fmr1°, 
and Shank3’), show behavioural abnormalities—including deficits in 


social interaction”. Similar behavioural abnormalities are observed 
in mouse models for environmental risk factors for developing ASD, 
such as exposure to maternal inflammation”. We sought to explore 
the mechanisms that enable the fever-associated rescue of sociabil- 
ity deficits, using both genetic and environmental mouse models for 
neurodevelopmental disorders. 


LPS rescues sociability in MIA offspring 


A febrile response can be exogenously induced by injecting mice with 
a low dose of LPS”. Indeed, the intraperitoneal administration of 
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Fig. 1| Immune stimulation rescues sociability deficits in MIA offspring. 

a, Body temperature profile after injection of saline (vehicle) or LPSin 
offspring of mothers injected with PBS (PBS offspring) (vehicle, n=11and LPS, 
n=11mice, from 5 independent experiments). The initial spike in body 
temperature is due to handling stress. b, c, Mice were tested for sociability 
(percentage of time spent investigating social object/total time spent 
investigating both social and inanimate objects) one day before LPS injection 
(pre-test; pre). Mice were then tested for sociability four hours after injection 
with vehicle or LPS (test). PBS offspring + vehicle, n=10; PBS offspring +LPS, 
n=9;MIA offspring + vehicle, n=10; MIA offspring + LPS, n=12; wild type 
(WT) + vehicle, n=8; wild type +LPS, n=11; Cntnap2 mutant + vehicle, n=11; 
Cntnap2 mutant +LPS, n=11; Fmri1 mutant + vehicle, n=11;Fmrl mutant +LPS, 
n=15; Shank3 mutant + vehicle, n= 8; Shank3 mutant + LPS, n=10; from 


(test) 


oy 
a 


3 independent experiments. I.p., intraperitoneal. d, Virus encoding the 
inhibitory DREADD (AAV,-hSyn-DIO-hM4D(Gi)-mCherry) was targeted tothe 
ventral part of the lateral preoptic nucleus (vLPO) of Vgat-Cre MIA mice. Scale 
bar,2mm.e, Body temperature profile after injection with vehicle or CNO. 

f, g, Mice were tested for sociability one day before injection (pre). Onthe 
following two days, mice received counterbalanced injections of either vehicle 
or CNO. Sociability was assessed two hours after injection. For experimentsin 
d-g,n=9 forall groups, from 2 independent experiments. All n values refer to 
the number of mice used. *P< 0.05, **P< 0.01, calculated by two-way repeated- 
measures analysis of variance (ANOVA) with Bonferroni's (a, e) or Sidak’s (c) 
post-hoc tests, or one-way repeated-measures ANOVA with Tukey’s post-hoc 
test (g). Graphs are mean+s.e.m. 
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Fig. 2| Immune stimulation reduces hyperactivation in the SIDZ of MIA 
offspring. a, Representative images illustrating FOS (green) expression inthe 
S1DZ and CeA, after injection with vehicle or LPS. Scale bars, 200 pm. Numerals 
indicate cortical layers. b,c, Quantification of FOS-expressing cells inthe 

S1DZ (b) and CeA (c). For experiments in a-c, PBS offspring + vehicle, n=8; 

PBS offspring + LPS, n=9; MIA offspring + vehicle, n=13; MIA offspring +LPS, 
n=11;from3 independent experiments. d,e, AAV encoding either EYFP or EYFP 
fused to enhanced halorhodopsin (NpHR) was bilaterally injected into the SIDZ 
of monogenic mutant mice. Scale bar, 500 um. f, Performance on sociability 
was assessed in the presence (on) and absence (off) of optical inhibition. For 
experiments in d-f, wild type + EYFP, n=7; wild type + NDHR, n=8; Cntnap2 
mutant + EYFP, n=11; Cntnap2 mutant + NpHR, n=9; FmrI mutant + EYFP, n=8; 
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Fmr1 mutant + NpHR, n=12; Shank3 mutant + EYFP, n= 8; Shank3 

mutant + NpHR, n=10; from 6 independent experiments. g, Representative 
images illustrating FOS expression in the SIDZ and CeA after injection 

of vehicle or LPS in monogenic mutant mice. Scale bars, 200 um. 

h, i, Quantification of FOS-expressing cells after injection of vehicle or LPS in 
the S1DZ (h) and CeA (i). For experiments in g-i, Cntnap2 mutant + vehicle, n=9; 
Cntnap2 mutant + LPS, n=10; Fmr1 mutant + vehicle, n=7; Fmrl mutant +LPS, 
n=9; Shank3 mutant + vehicle, n= 6; Shank3 mutant +LPS,n=8; from 

3 independent experiments. Alln values refer to the number of mice used. 
Statistics calculated by two-way ANOVA with Tukey’s post-hoc test (b,c) or 
Sidak’s post-hoc test (h, i), or two-way repeated-measures ANOVA with Sidak’s 
post-hoc test (f). Graphs are mean+s.e.m. 
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Fig. 3|IL-17a rescues sociability deficits in both MIA offspring and 
monogenic mutant mice. a, IFNy, IL-6, IL-17a and TNF levels in plasma after 
injection of vehicle or LPS. PBS offspring + vehicle, n= 6; PBS offspring +LPS, 
n=8;MIA offspring + vehicle, n= 6; MIA offspring + LPS, n=8; wild 

type + vehicle, n=6; wild type +LPS, n=7; Cntnap2 mutant + vehicle, n=4; 
Cntnap2 mutant +LPS,n=5;Fmrl mutant + vehicle, n=5;Fmr1 mutant+LPs, 
n= 6; Shank3 mutant + vehicle, n=7; Shank3 mutant + LPS, n=7; from 

3 independent experiments. b, /[17ra expression in wild-type and IL-17Ra- 
knockout (KO) mice at anterior—-posterior (AP) —-0.58 mm, from lindependent 
experiment. Scale bar, 1mm.c, Colabelling of //17ra and NeuN inthe S1DZ, from 
2 independent experiments. Scale bar, 100 pm. d, Quantification of /l17ra 
expression within the SIDZ according to cortical layer in PBS offspring. n=8, 
from 2 independent experiments. e, f, Mice were tested for sociability one day 
before injection (pre) and after bilateral administration of vehicle or IL-17a into 
the S1DZ (test). PBS offspring + vehicle, n= 11; PBS offspring + IL-17a,n=12; MIA 
offspring + vehicle, n=14; MIA offspring +IL-17a, n =10; wild type + vehicle, 
n=11; wildtype +IL-17a, n=11; Cntnap2 mutant + vehicle, n=8; Cntnap2 
mutant +IL-17a,n=10; Fmri mutant + vehicle, n=9; Fmr1 mutant + IL-17a, n=11; 
from 6 independent experiments. Alln values refer to the number of mice used. 
Statistics calculated by two-way ANOVA with Dunnett’s post-hoc test (a) or 
two-way repeated-measures ANOVA with Sidak’s post-hoc test (f). Graphs are 
meant+s.e.m. 


LPS (50 pg kg?) to control mice—offspring born to mothers injected 
with PBS—led to a significant increase in body temperature (of about 
0.5-1.0 °C) about 4-5 h after the treatment (Fig. 1a). To investigate 
whether LPS injection can ameliorate sociability deficits, we compared 
sociability in adult male mice one day before LPS injection (pre-test) 
and four hours after injection (test), at which point the increase in 
body temperature initiates (Fig. 1b). As previously reported**>°, MIA 
offspring born to mothers injected with polyinosinic:polycytidylic acid 
(poly(I:C)) at embryonic day (E)12.5 exhibited impaired social approach 


behaviour during the pre-test (Fig. 1c, Extended Data Fig. 2). The Cntnap2, 
Fmriand Shank3 monogenic models also displayed sociability deficits, 
albeit with marked variability during the pre-test (Extended Data Fig. 1). 
We therefore compared MIA offspring to monogenic mutant mice that 
exhibited a sociability index lower than about 62%, the mean value for 
the three monogenic lines (Extended Data Fig. 1a). 

LPS injections robustly rescued the characteristic deficits in socia- 
bility of MIA offspring; treated MIA offspring displayed an interest in 
social objects that was indistinguishable from that of controls (Fig. 1c, 
Extended Data Fig. 2). LPS injection did not affect baseline sociability in 
control offspring. The LPS-induced rescue of sociability was absent 72h 
after the treatment, paralleling the transient nature of fever-associated 
improvement that has previously been observed in children with ASD! 
(Extended Data Fig. 3a—e). Furthermore, the LPS-induced rescue was 
observed in both young (2-5 months old) and aged (9-12 months old) 
mice (Fig. Ic, Extended Data Fig. 3f-j), which suggests that this is a 
generalizable phenomenon that is not confined to early adulthood. 
The LPS-induced rescue of the sociability deficits was also evident 
when assayed using a reciprocal social interaction test (Extended Data 
Fig. 3k). Finally, the LPS treatment of MIA offspring also suppressed the 
enhanced marble burying behaviour (Extended Data Fig. 31), which 
shows that inflammation can relieve several of the behavioural phe- 
notypes associated with MIA. Unlike in MIA offspring however, LPS 
treatment did not restore sociability in mutant mice that are deficient 
for Cntnap2, Fmr1 or Shank3 (Fig. 1c, Extended Data Fig. 2), indicating 
that rescue by LPS-driven inflammation may be applicable to only a 
subset of mouse models for neurodevelopmental disorders. 


Acute fever does not restore sociability 


We next probed whether fever has a role in the observed behavioural 
rescue. Unlike in control offspring, LPS treatment did not induce 
changes in body temperature in MIA mice (Extended Data Fig. 4a), 
which suggests that the febrile response might not be the main factor 
that contributes to the rescue. To directly test whether fever is dispen- 
sable, we sought to increase the body temperature of the mice without 
inducing systemic inflammation by targeting inhibitory designer recep- 
tors exclusively activated by designer drugs (DREADDs)” to GABAergic 
neurons in the ventral part of the lateral preoptic nucleus of Vgat-Cre 
(Methods) mice’s (Fig. 1d). As previously reported”, inhibition of these 
neurons led to an increase in body temperature of about 1 °C (Fig. le). 
However, the induction of febrile response alone did not promote 
social preference in MIA offspring (Fig. 1f, g, Extended Data Fig. 4b-e), 
confirming that fever per se is not the main driver of the LPS-induced 
rescue. Of note, Vgat-Cre MIA offspring exhibited sociability deficits 
comparable to those of wild-type MIA offspring, were unaffected by 
treatment with clozapine N-oxide (CNO) and showed an increase in 
sociability after treatment with LPS (Extended Data Fig. 4f-j). 


LPS reduces FOS induction in the S1DZ 


We previously established that adult MIA offspring display cortical 
abnormalities that are preferentially localized in the S1IDZ°, asubregion 
of the primary somatosensory cortex (S1) that is cyto-architecturally 
defined by the absence of a discernible fourth layer (Extended Data 
Fig. 5). The cortical phenotype is characterized by an overall increase 
inneural activity that, when reduced, can acutely rescue MIA-induced 
deficits in social behaviours®. We therefore investigated whether the 
LPS-induced behavioural rescue in MIA offspring is accompanied 
by changes in neural activity in the SIDZ. MIA offspring exhibited an 
increase inthe number of cells in the SIDZ that express FOS (a marker 
for neuronal activation), relative to control offspring. However, in 
LPS-treated MIA offspring, the number of FOS* neurons in the S1DZ 
was reduced to the level of control offspring (Fig. 2a, b, Extended Data 
Fig. 6a—c). LPS injections did not elicit a generalized, brain-wide effect 
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Fig. 4 | IL-17a is necessary for the LPS-induced rescue of sociability deficits 
and reduction of neural activity in the SIDZ in MIA offspring. a, Mice were 
tested for sociability one day before injection (pre). The following day, blocking 
antibody against IL-17a (anti-IL-17a) or isotype-control antibody (isotype) was 
administered intracerebroventricularly (i.c.v.), 30 min before injection of 
vehicle or LPS. Sociability was assessed four hours after injection of vehicle or 
LPS (test). PBS offspring + vehicle + isotype, n=9; PBS offspring + LPS + isotype, 
n=11;MIA offspring + vehicle + isotype, n=10; MIA offspring + LPS + isotype, 
n=10;MIA offspring + LPS + anti-IL-17a, n=10; from 7 independent 
experiments. b-d, Firing rate of neurons of the SIDZ before and four hours 
after injection of vehicle or LPS, in PBS and MIA offspring pre-treated with 
isotype-control antibody or blocking antibody against IL-17a.c, Example raster 
plot with firing rate profile before (baseline) and after (test) treatment with 
LPS, froman PBS mouse and MIA mouse pre-treated with isotype-control 
antibody. d, Normalized firing-rate (FR) change after treatment represented as 
box-and-whisker plots, indicating median, interquartile range and data limits 


onFOS expression in MIA offspring; the number of FOS* neurons either 
remained unchanged (as in several of the cortical regions we examined) 
or increased, as in the central amygdala (CeA)—a region that is known 
to be activated by LPS” (Fig. 2a, c, Extended Data Fig. 6d, e). Therefore, 
the LPS-induced behavioural rescue in MIA offspring was accompanied 
by areduction in neural activity in the SIDZ. 

Dysregulation of neural activity and deficits in interneuron function 
in the S1 have previously been associated with various genetic mouse 
models of neurodevelopmental disorders” **. We therefore sought 
to determine whether increased neural activity can also be observed 
inthe SIDZ of monogenic mutant mice. The number of FOS* neurons 
in the SIDZ of these mutant mice was increased compared to that of 
wild-type mice, and the magnitude of this increase correlated with the 
severity of the sociability deficits—notably in Cntnap2 and Fmr1 mutant 
mice (Extended Data Fig. 7a, b). These data suggest that increased 
neural activity in the SIDZ may also contribute to the expression of 
sociability deficits in monogenic mutant mice, similarly to what has 
been described for MIA offspring®. Consistent with this idea, optoge- 
netically reducing neural activity in the SIDZ could rescue sociability 
deficits in Cntnap2 and Fmr1 mutant mice (Fig. 2d-f, Extended Data 
Fig. 7c-f). Shank3 mutant mice also showed an increase in sociability 
upon photoinhibition, but this increase was not significantly different 
from that of Shank3 control mice that express enhanced yellow fluores- 
cent protein (EYFP). Therefore, reducing neural activity in the SIDZ was 
sufficient to restore sociability in Cntnap2 and Fmr1 mutant mice, as well 
as in MIA offspring®. Unlike in MIA offspring however, LPS treatment 
did not reduce the number of FOS* neurons in the SIDZ of monogenic 
mutant mice (Fig. 2g-i). 
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as defined by Tukey. PBS offspring + vehicle + isotype, n=65 cells; 

PBS offspring + LPS + isotype, n=42 cells; PBS offspring + LPS + 
anti-IL-17a,n=40 cells; MIA offspring + vehicle + isotype, n=75 cells; 

MIA offspring +LPS + isotype, n=48 cells; MIA offspring + LPS + anti-IL-17a, 
n=43 cells; from 2 PBS offspring and 2 MIA offspring in 12 independent 
experiments. e, Lentivirus encoding either EYFP or enhanced green 
fluorescent protein (EGFP) fused to nuclear Cre (nCre) was bilaterally injected 
into the S1DZ of IL-17Ra" MIA offspring. Scale bar, 200 um. f, g, Mice were 
tested for sociability one day before injection (pre). The following day, mice 
were tested for sociability four hours after LPS injection (test). For experiments 
ine, f, I-17ra™“",EYFP,n=9 and IL-17ra“";EGFP:nCre, n=10; from5 independent 
experiments. Unless otherwise indicated, n values refer to the number of mice 
used. Statistics calculated by two-way repeated-measures ANOVA with 
Bonferroni's post-hoc test (a, f) or two-way ANOVA with Tukey’s post-hoc 

test (d). Graphs are mean+s.e.m. 


IL-17a rescues sociability deficits 

The injection of LPS is known to increase the production of inflamma- 
tory cytokines‘. We observed that administration of LPS results ina 
robust increase in the plasma levels of IFNy, IL-6 and TNF (also known 
as TNF-a) (Fig. 3a). IL-17a—the orthologue of which in Caenorhabditis 
elegans has previously been implicated in modulating sensory 
responses*—was prominently upregulated in MIA offspring, but not 
in monogenic mutant mice or in control mice (Fig. 3a). Furthermore, 
we noted that the receptor subunit A for IL-17a (IL-17Ra) is expressed 
in cortical neurons, including in the S1DZ (Fig. 3b-d, Extended Data 
Fig. 8a—c). These data suggested that the increased levels of IL-17a after 
treatment with LPS in MIA offspring, but notin monogenic mutant mice, 
may directly affect the SIDZ and thus restore sociability. Consistent with 
this idea, the direct administration of recombinant IL-17a into the SIDZ 
was sufficient to increase sociability not only in MIA offspring but also 
in Cntnap2and Fmr1 mutant mice (Fig. 3e, f, Extended Data Fig. 8d-h). 


LPS-induced rescue requires IL-17a 

To further determine whether IL-17a mediates the LPS-driven behav- 
ioural rescue, we inhibited IL-17a activity in the brain via intracere- 
broventricular injection of blocking antibodies. Antibodies against 
IL-17a prevented both the LPS-induced rescue of sociability (Fig. 4a, 
Extended Data Fig. 9a—e) and the reduction of FOS expression in 
the SIDZ of MIA offspring (Extended Data Fig. 9f). To directly assay 
the effects of LPS on neural activity, we used multi-electrode arrays 
to measure the firing rate of neurons of the S1DZ in awake mice 


(Extended Data Fig. 10a, b). Upon treatment with LPS, we observed a 
decrease in the overall firing rate that was prevented by blocking IL-17a 
in MIA offspring. Treatment with LPS or IL-17a blocking antibody did 
not change the neuronal firing rate in the SIDZ of control offspring 
(Fig. 4b-d, Extended Data Fig. 10c). Furthermore, LPS injection did 
not restore sociability to MIA offspring that are deficient for IL-17Rain 
the neurons of the S1DZ (Fig. 4e, f, Extended Data Fig. 10d-j). Our data 
collectively demonstrate that IL-17a mediates the restoring effects 
of inflammation on social behaviours by directly acting on IL-17Ra* 
neurons of the SIDZ. 


Discussion 


Previous data in mice have suggested that increased production of 
IL-17a in pregnant mothers may present as a risk factor for neurode- 
velopmental disorders in offspring®°””. On the basis of our current 
findings, we propose that in adult MIA offspring the same cytokine is 
beneficial and ameliorates sociability phenotypes during episodes of 
inflammation. Treatment with LPS led to an increase in IL-17a levels in 
the blood selectively in MIA offspring—but not in other monogenic 
mutant mice. This suggests that the inflammatory responses may 
result in beneficial effects only for individuals who have their immune 
systems primed by prenatal exposure to immune activation, or by 
other environmental factors. A better understanding of the role of 
a primed immune system among patients with neurodevelopmen- 
tal disorders may help to identify those patients whose behavioural 
symptoms are likely to improve after exposure to fever-associated 
inflammation. Furthermore, elucidating the mechanisms by which 
IL-17a can induce two opposing behavioural outcomes depending on 
when its upregulation occurs (during embryonic brain development 
or in the adult brain) may provide opportunities to devise therapeu- 
tic as well as preventive treatments for the behavioural symptoms 
associated with ASD. 
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Methods 


Mice 

All experiments were performed according to the Guide for the Care 
and Use of Laboratory Animals and were approved by the National Insti- 
tutes of Health and the Committee on Animal Care at Massachusetts 
Institute of Technology. C57BL/6 mice were purchased from Taconic 
Biosciences for generating PBS and MIA offspring. Slc32a1(Vgat)- 
Cre (028862) mice were purchased from Jackson Laboratories, and 
inbred. For monogenic experiments, C57BL/6, Cntnap2 (017482), Fmr1 
(003025) and Shank3 (017688) mice were purchased from Jackson 
Laboratories, and inbred after colonizing with segmented filamen- 
tous bacteria (SFB) from donor mice. /I-17ra“ and IL-17Ra-knockout 
mice have previously been described”*”’. All mice were males aged 
2-5 months, unless otherwise specified. 


Maternal immune activation 

Mice were mated overnight with females carrying SFB in their guts”°. 
OnE12.5, pregnant female mice were weighed and injected witha single 
dose (20 mg/kg, intraperitoneal injection) of poly(I:C) (P9582, Sigma 
Aldrich) or PBS. Each dam was returned to its cage and left undisturbed 
until the birth ofits litter. All pups remained with the mother until wean- 
ing on postnatal day (P)21-28, at which time mice were group-housed at 
amaximum of 5 per cage with same-sex littermates. Matings between 
Vgat-Cre (c/c) males and wild-type females were used to make MIA 
Vgat-Cre mice. 


Stereotaxic surgery 

Surgeries were carried out using aseptic techniques. Mice were anaes- 
thetized using a mixture of ketamine (100 mg/kg, intraperitoneal 
injection) and xylazine (10 mg/kg, intraperitoneal injection). Mice 
were given pre-operative slow-release buprenorphine (1.0 mg/kg, 
subcutaneous injection). For manipulating body temperature with 
vLPO inhibition, Vgat-Cre mice received bilateral stereotaxic injec- 
tions of virus (200 nl per side) at rates of <O.1 ml/min (AAV,-hSyn- 
DIO-hM4Di-mCherry (Addgene)). Virus was bilaterally targeted to 
the vLPO (AP +0.15, medial-lateral (ML) +0.50, dorsal-ventral (DV), 
-4.9, lambda was raised 600 um above bregma). For experiments 
involving optical manipulation in the S1DZ, wild-type, Cntnap2, Fmr1 
and Shank3 mice received bilateral stereotaxic injections of virus 
(400 pl/side) at a rate of <O.1 ml/min (AAV,-hSyn-EYFP, AAV,-hSyn- 
eNpHR3.0:EYFP). Viruses were bilaterally targeted to the SIDZ (AP 
-0.50, ML+2.6, DV -0.80, lambda was level with bregma). To deliver 
light into the SIDZ, a300-um optic fibre was superficially implanted 
in the S1DZ. For experiments involving genetic knockout of IL-17Ra 
in the SIDZ, /l-17ra“ mice received bilateral stereotaxic injections 
of virus (800 nl per side) at a rate of <O.1 ml/min (pLenti-hSyn-EYFP, 
pLenti-hSyn-EGFP:nCre). Viruses were bilaterally targeted to the S1DZ. 
Alllentiviruses were made in-house. For administration of cytokines 
into the SIDZ, cannulas (PlasticsOne) were implanted superficially 
within the S1DZ. For central administration of blocking antibodies, 
cannulas were implanted above the right lateral ventricle (AP -0.30, 
ML +1.0, DV -1.35, lambda was raised 600 pm above bregma). Cannulas 
were fitted with dummy cannulas (PlasticsOne) to maintain cannula 
patency after surgery. 


Tracking of body temperature 

Body temperature was measured using the Anipill remote temperature 
monitoring system (007894-001, DSI). Adult male mice were implanted 
with an Anipill capsule inthe abdominal cavity. Experiments were car- 
ried out more than three weeks after surgery. Mice were singly housed 
the day before the experiment. Ambient temperature was maintained 
at 23.5 °C, consistent with the ambient temperature of the vivarium. 
Body temperature was sampled in 5-min increments. LPS or vehicle 


injections occurred between 11:00-13:00 for experiments assaying the 
effects of LPS on body temperature in PBS and MIA offspring. 


Immunohistochemistry 

Mice were transcardially perfused with cold paraformaldehyde (PFA) 
(4% in PBS). Brains were keptin PFA overnight at 4 °C before vibratome- 
sectioning (Leica VT1000s). Brains were cut at 50-um thickness for 
FOS quantification. Brains were cut at 100-um thickness for all other 
experiments. 

Before antibody labelling, sections were incubated in blocking solu- 
tion (0.4% Triton X-100 and 2% goat serum in PBS) for 30 min. Sections 
were then incubated in blocking solution containing primary anti- 
bodies overnight at room temperature. Primary antibodies used were 
chicken anti-GFP (1:1,000, Ab5450, Abcam), rabbit anti- FOS (1:500, 
ABE457, Millipore), rabbit anti-DsRed (1:1,000, 632496, Clontech) and 
mouse anti-NeuN (1:1,000, MAB377, Millipore). Sections were washed 
in wash buffer (0.4% Triton X-100 in PBS) three times before second- 
ary antibody labelling. Sections were incubated in blocking solution 
containing secondary antibodies and DAPI (1:5,000, D1306, Thermo 
Fisher) for three hours at room temperature. Images of stained slices 
were acquired using a confocal microscope (LSM710, Carl Zeiss) with 
a10x, 20x or 40x objective lens. 


Insitu hybridization 

Mice were transcardially perfused with cold PBS. Brains were extracted 
and embedded in optimal cutting temperature (OCT) compound on 
dry ice. Sections were cut at 20-um thickness on a cryostat. In situ 
hybridizations were performed using the RNAscope 2.5 HD Assay-Red 
kit (22350, Advanced Cell Diagnostics) using a probe targeting the 
/l17ratranscript (Mm-Il17ra-O1, 566131, Advanced Cell Diagnostics). The 
probe was designed to target region 444-882 of the /l17ra transcript 
(NM_008359.2). Modifications to the kit protocol to improve adherence 
of tissue to the slide included an extension of the fixation time to 30 min 
and the addition of ahumidified bake step at 40 °C immediately before 
probe hybridization. Sections were counterstained with DAPI. Images 
were acquired using a confocal microscope (LSM710, Carl Zeiss) with 
a10x or 20x objective lens. //17ra and DAPI expression was quantified 
using QuPath*’. Cells were divided into the following categories based 
on level of //17ra expression: low = 1-3 puncta, medium = 4-9 puncta, 
high = 10-15 puncta, highest = >15 puncta. 


In situ hybridization followed by immunohistochemistry 

For experiments assaying the overlap of //17ra and NeuN expression, 
immunohistochemistry for NeuN was performed following a modi- 
fied in situ hybridization protocol. The RNAscope 2.5 HD Assay-Red 
kit in situ protocol was modified in the following ways: sections were 
baked at 60 °C, followed by a10-min fixation step with 4% PFA at room 
temperature. Sections were then stored in 70% ethanol at 4 °C over- 
night. Sections were permeabilized in 8% SDS for 10 min. Sections 
were washed twice with PBS between each step. After SDS treatment, 
the RNAscope 2.5 HD Assay-Red kit protocol was followed from the 
probe hybridization step. Following the completion of the in situ pro- 
tocol, sections were incubated in blocking buffer containing anti-NeuN 
antibody overnight at 4 °C. Sections were then incubated in blocking 
buffer containing DAPI and secondary antibody for two hours at room 
temperature. 


Behavioural analysis 

Male mice were tested during the light cycle in a room with lighting 
maintained at 230 lux. Mice were transferred to the testing area at 
least one hour before the initiation of experiments. Tracking of mouse 
behaviour was done using the EthoVision XT (Noldus) tracking system. 


Three-chamber social approach assay. Adult male mice were assayed 
for sociability using athree-chamber social approach assay. The arena 


was constructed of white acrylic (50 cm x 35 cm x 30 cm). Wire cups 
(Spectrum Diversified) were placed in the back left and right corner of 
the arena beneath water-filled 1-1 bottles (Nalgene). On day O, mice were 
habituated to the arena for 10 min. Immediately after habituation, mice 
were singly housed. On day 1 (pre-test), mice were placed in the centre 
of the arena and allowed to freely explore. Following 10 min, mice were 
confined to the centre of the arena. An inanimate object (rubber stop- 
per) oramale conspecific were placed beneath the wire cups. Placement 
of the inanimate object and social target were alternated. Mice were 
then allowed to freely explore the arena for 10 min. Interaction time 
was defined as time spent in the areas circumscribing the wire cups 
(<2. cm). Sociability was defined as interaction time with the social target 
divided by total interaction time and expressed as a percentage. For 
experiments involving LPS injections, mice were injected with either 
saline (vehicle) or LPS (50 pg/kg, intraperitoneal injection, L2630, 
Sigma) on day 2 (test), four hours before testing. For the experiment 
assaying sociability 72 h after injection of LPS, mice used for 4-h LPS 
sociability experiments were tested for sociability again at 72 h. 


Three-chamber social approach assay with DREADD manipulation. 
Adult male Vgat-Cre MIA offspring were bilaterally injected with virus 
encoding the inhibitory DREADD receptor fused to mCherry into the 
vLPO. After more than three weeks of recovery, mice were assayed on 
the three-chamber social approach assay outlined above. Baseline 
sociability was assayed on day 1 (pre-test). On day 2 and day 3, mice 
were injected with either vehicle or CNO (1.5 mg/kg, intraperitoneal 
injection, BML-NS105, Enzo Life Sciences), two hours before initiation of 
behaviour. Injection order was counterbalanced. Following behavioural 
experiments, post-mortem histology was used to confirm mCherry 
expression within the vLPO. For experiments assaying the effect of 
injection of CNO and LPS in Vgat-Cre PBS and MIA offspring that have 
not undergone surgery, baseline sociability was assessed on day 1. 
On day 2 and 3, mice received counterbalanced injections of CNO or 
vehicle. On day 4, mice were injected with LPS. 


Three-chamber social approach assay with administration of IL-17a 
into the S1DZ. Adult male mice were implanted with a cannula into 
S1DZ bilaterally and allowed to recover for more than two weeks before 
the behavioural experiments. On day 1 (pre-test), mice were assayed 
for sociability. On day 2 (test), mice were anaesthetized briefly using 
isoflurane and either vehicle or IL-17a (50 ng per side in1 pl at arate of 
180 nl/min, 7956-ML/CF, R&D) was administered bilaterally into the 
S1DZ through 250-um projecting injector tips (PlasticsOne). Four hours 
after administration of vehicle or IL-17a, mice were assayed for sociabil- 
ity. Cannula placements were verified using histology. 


Three-chamber social approach assay with S1DZ optogenetic 
inhibition. EYFP or NpHR were virally targeted to the S1DZ. After 
two weeks of recovery, mice were allowed to freely explore the arena 
for 10 min. The following day, the mice were given 3 min of no stim- 
ulation (‘off’ session) and 3 min of laser stimulation (‘on’ session) 
(594 nm, 6mW). 


Three-chamber social approach assay with IL-17a blocking 
antibody. For central cytokine blockade experiments, adult male mice 
were implanted with a cannula into the lateral ventricle and allowed to 
recover for more than two weeks before the behavioural experiments. 
On day 1 (pre-test), mice were tested for baseline sociability. On day 2 
(test), mice were injected with IL-17a blocking antibody (clone 50104; 
R&D) orisotype-control antibody (IgG2a, clone 54447; R&D). Antibodies 
were dissolved in saline and administered intracerebroventricularly 
at 1mg/kg in 500 nl ata rate of 180 nl/min through 750-um projecting 
injector tips (PlasticsOne). Blocking antibody was administered 30 min 
before administration of LPS. Four hours following administration of 
LPS, mice were assayed for sociability. 


Three-chamber social approach assay in mice deficient for IL-17Ra 
in the S1DZ. IL-17Ra-knockout was mediated by viral delivery of Cre re- 
combinase, expressed under the control of the human synapsin (hSyn) 
promoter, into the S1DZ of Il-17ra“ MIA offspring. After at least three 
weeks of recovery, mice were assayed for sociability. On the next day 
sociability was assessed four hours following administration of LPS. 


Marble burying assay. On day 1, mice were tested for their baseline 
marble burying phenotype. On day 2, four hours before beginning the 
marble burying assay, mice were treated with either LPS or vehicle. The 
marble burying assay was carried out as previously described®. Mice 
were placed into testing arenas (arena size 40 cm x 20 cm x 30 cm; bed- 
ding depth 3 cm) each containing 20 glass marbles (laid out in 4 rows 
of 5 marbles equidistant from one another). At the end of the 15-min 
exploration period mice were carefully removed from the testing cages 
and the number of marbles buried was recorded. The marble burying 
index was arbitrarily defined as the following: 1 for marbles covered 
>50% with bedding, 0.5 for marbles covered <50% with bedding or 0 
for anything less. 


Reciprocal social interaction assay. Four hours before testing, mice 
were injected with vehicle or LPS. Two unfamiliar mice of the same treat- 
ment and background were placed ina fresh mouse cage and allowed to 
freely interact for 10 min. Videos were acquired using IC Capture (The 
Imaging Source) at a 640 x 480 aspect ratio and 25 frames per second. 
Social interaction (close following, push—crawl, nose-nose sniffing 
and nose-anus sniffing) was scored by an observer blind to treatment 
and background”. 


Quantification of FOS* cells in the brain after administration of 
LPS 

Adult male mice were killed five hours after injection of LPS. FOS* cells 
were quantified using the Cell Counter plugin in Fiji”. All cells were 
counted within a single coronal section of each respective brain region, 
as defined by the Paxinos and Franklin Mouse Atlas®. Regions quanti- 
fied include: medial prefrontal cortex (mPFC) (comprising prelimbic 
cortex (PrL) and and infralimbic cortext (IL)), AP +1.98; SIDZ, AP -0.46; 
primary somatosensory cortex barrel field (SIBF), AP —0.46; primary 
motor cortex (M1), AP—0.46; secondary motor cortex (M2), AP—-0.46; 
secondary auditory cortex dorsal part (AuD), AP -1.94; CeA, AP -1.94; 
and primary visual cortex (V1), AP —3.64. For experiments testing the 
IL-17a dependence of LPS-induced changes in FOS expression, mice 
were injected intracerebroventricularly with control antibodies or 
blocking antibody against IL-17a 30 min before intraperitoneal injec- 
tion of vehicle or LPS. Surgical and injection methods were identical 
to behavioural experiments. 


Enzyme-linked immunosorbent assay 

After four hours of administration of vehicle or LPS, mice were anaes- 
thetized by intraperitoneal injection of Fatal-Plus (100 mg/kg). All 
blood samples were centrifuged at 10,000g for 10 min at 4 °C. Allsam- 
ples were stored at —-80 °C until further analysis. Cytokine concentra- 
tions in plasma were measured using an ELISA kit (IFNy; 430804, TNF; 
430904, IL-6; 431304 and IL-17a; 432504, Biolegend), following the 
manufacturer’s instructions. 


PCR for assaying IL-17Ra knockout 

1017ra male mice were bilaterally injected with virus encoding nuclear 
Cre fused to EGFP, or control virus encoding only EYFP, into the SIDZ. 
After more than three weeks, injection sites were dissected from the 
S1. Single cells were dissociated from brain tissue using a modified 
version™ of the Papain Dissociation Kit protocol (LKO03153, 
Worthington) and sorted ona BD FACS Aria (BD Biosciences) based on 
EGFP and EYFP expression. RNA was extracted from sorted cells using 
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a Quick-RNA micro-prep kit (Zymo). Twenty nanograms of RNA was 
converted into cDNA using oligodT (Protoscript First Strand CDNA 
Synthesis Kit, NEB). //17ra mRNA expression was analysed using PCR. 
One microlitre of cDNA was diluted in a 20-pl reaction volume. /l17ra 
and Gapdh mRNA expression was assessed using the following prim- 
ers: /l17ra 5’-AGATGCCAGCATCCTGTACC-3’ and 5’-CACAGTCACAG 
CGTGTCTCA-3’; Gapdh 5’-GACTTCAACAGCCTCCCACTCTTCC-3’ 
and 5’-TGGGTGGTCCAGGTTTCTTACTCCTT-3’. Cycling conditions 
for /[l17ra: 95 °C x 5 min (1 cycle), 95 °C x 20s, 60 °C x 30s, 72 °C x 30s 
(32 cycles), 72 °C x 5 min (1 cycle) and 4 °C hold. Cycling conditions 
for Gapdh: 95 °C x 5 min (1cycle), 95 °C x 20s, 60 °C x 30s, 72 °C x 30s 
(28 cycles), 72 °C x 5 min (1 cycle) and 4 °C hold. Band intensity from 
gel images was quantified using Image). 


In vivo electrophysiology 

Electrophysiological experiments were conducted in head-fixed mice 
trained to walk ona rotating running wheel. Before training, mice were 
implanted with custom crowns permitting head-fixing above the wheel 
and allowed to recover for one week before training. Mice were trained 
to walk on the wheel for three 10-min sessions daily for at least one week. 
Following training, mice were implanted with a multi-electrode array 
targeting the S1DZ and allowed to recover before testing. To assess 
the IL-17a dependence of LPS-induced changes in neural activity, mice 
were injected intraperitoneally with control antibodies or blocking 
antibody against IL-17a (1 mg per mouse), 30 min before intraperitoneal 
injection of vehicle or LPS. Baseline neural activity was measured while 
mice were running on the wheel immediately before the first injection. 
After the first injection, mice were returned to their home cage. 
Post-injection (test) neural activity during wheel running was measured 
4 hafter injection of vehicle or LPS. 


Multi-electrode array construction and implantation. Custom multi- 
electrode array scaffolds (drive bodies) were designed using 3D CAD 
software (SolidWorks) and printed in Accura 55 plastic (American 
Precision Prototyping), as previously described**”*. Beforeimplantation, 
eacharray scaffold was loaded with 8-24 independently movable micro- 
drives carrying 12.5-m nichrome (California Fine Wire Company) 
tetrodes. Electrodes were pinned to custom-designed 32- or 128-channel 
electrode interface boards (Sunstone Circuits) along with acommon 
reference wire (A-M systems). 


Electrophysiological recordings and spike sorting. Signals were 
acquired using a Neuralynx multiplexing digital recording system 
(Neuralynx) through a combination of 32- and 64-channel digital 
multiplexing headstages plugged into the electrode interface board 
of the implant. Signals from each electrode were amplified, filtered 
between 0.1 Hz and 9 kHz and digitized at 30 kHz. Initial spike sorting 
was performed using MountainSort, followed by manual quality 
control using MClust toolbox (http://redishlab.neuroscience.umn. 
edu/mclust/MClust.html). 


Analysis of firing rate. At time 0 in each recording session, mice began 
walking on the wheel rotating at 7.5 cms. Data collected between 
land 2s were included in the analysis. The firing rate was calculated 
across 1-s time windows during which mice were walking stably onthe 
rotating wheel, averaged across 25-50 trials per condition. The first 
second after the onset of wheel rotation was omitted, to avoid firing 
rate changes due to acceleration. Firing rate was sampled with a 1-ms 
bin width passed through a box car filter (100 ms). The resulting peri- 
stimulus time histograms were then smoothed witha 50-ms Gaussian. 


To assess the effect of treatment on neural activity, changes in firing 
rates during wheel running 4 h after injection were normalized to the 
pre-injection baseline. Cells showing a firing rate below 0.1 Hz were 
excluded from analysis. 


Statistics and reproducibility 

Statistical analyses were performed using GraphPad Prism. Sample 
size was chosen on the basis of similar previous studies, and not on 
statistical methods to predetermine sample size®». Within each itera- 
tion of an experiment, mice were randomly assigned to groups with 
approximately balanced sample size. Behavioural results from mice 
with inaccurate targeting of viral infection or cannula implantations 
were excluded. Experimenters were blind to subject treatment during 
data collection and analysis. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Source Data for Figs. 1-4 and Extended Data Figs. 1-4, 6-10, containing 
raw data for all experiments, are provided with the paper. All other data 
are available from the corresponding author on reasonable request. 
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Extended Data Fig. 1| Cntnap2, Fmr1 and Shank3 mutant mice show variable time (c) and distance travelled (d) during the three-chambered sociability 


sociability performance.a, Sociability performance. MIA, n=13; wild-type, experiments described ina. Alln values refer to the number of mice used. 
n=22; Cntnap2 mutant, n=71;Fmrl1 mutant, n=165; Shank3 mutant, n=50; Statistics calculated by one-way ANOVA with Dunnett’s post-hoc test (a, c, d) or 
from 30 independent experiments. b, Time spent investigating social (S) two-way ANOVA with Dunnett’s post-hoc test (b). Graphs are mean +s.e.m. 


versus inanimate (I) objects for mice described ina.c,d, Totalinteraction 
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Extended Data Fig. 2 | Further behavioural analyses for sociability 
performance after treatment with LPS in PBS and MIA offspring, and 
monogenic mutant mice. a—d, Time spent investigating social (S) versus 
inanimate (I) objects (a), total interaction time (b), time spent in social (S), 
centre (C) or inanimate (I) chamber (c), and distance travelled (d) for sociability 
experiments in Fig. 1c. PBS offspring + vehicle, n=10; PBS offspring +LPS,n=9; 
MIA offspring + vehicle, n=10; MIA offspring + LPS, n=12; wild type (WT) + 


vehicle, n= 8; wild type+ LPS, n=11; Cntnap2 mutant + vehicle, n=11; Cntnap2 
mutant+LPS, n=11; Fmri mutant + vehicle, n=11; Fmr1 mutant +LPS, n=15; 
Shank3 mutant + vehicle, n= 8; Shank3 mutant + LPS, n=10; from 3 independent 
experiments. All values refer to the number of mice used. Statistics calculated 
by two-way ANOVA with Sidak’s (a) or Dunnett’s (c) post-hoc test, or two-way 
repeated-measures ANOVA with Sidak’s post-hoc test (b, d). Graphs are 

mean +s.e.m. 
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Extended Data Fig. 3| LPS-induced rescue of MIA behavioural phenotypes is 
transient, effective in aged mice and extends beyond three-chambered 
sociability. a—e, Sociability measured 72 h after injection with vehicle (Veh) 

or LPS, in PBS and MIA offspring from Fig. 1c. Data expressed as per cent 
sociability (a), time spent investigating social (S) versus inanimate (I) objects (b), 
total interaction time (c), time spent in social (S), centre (C) or inanimate (1) 
chamber (d), and distance travelled (e) during three-chambered sociability 
experiments. PBS offspring + vehicle, n=7; PBS offspring + LPS, n=7; 

MIA offspring + vehicle, n= 8; MIA offspring + LPS, n=6; from 2 independent 
experiments. f-j, Sociability measured before and 4 hafter injection of vehicle 
or LPSin aged MIA mice (9-12 months old). Data are expressed as per cent 
sociability (f), time spent investigating social (S) versus inanimate (I) objects (g), 
total interaction time (h), time spent in social (S), centre (C) or inanimate (I) 


chamber (i), and distance travelled (j) during three-chambered sociability 
experiments. MIA offspring + vehicle, n= 6; MIA offspring + LPS, n=7; from 

2 independent experiments. k, Reciprocal social interactions measured after 
treatment with vehicle or LPS in PBS or MIA offspring. PBS offspring + vehicle, 
n=9;PBS offspring + LPS, n=9; MIA offspring + vehicle, n=11; MIA 

offspring + LPS, n=11; from 4 independent experiments. I, Marble burying 
index (percentage of buried marbles) measured before and 4h after treatment 
with vehicle or LPS in PBS or MIA offspring. PBS offspring + vehicle, n=12; PBS 
offspring + LPS, n=12; MIA offspring + vehicle, n=12; MIA offspring +LPS, 
n=11; from 5 independent experiments. All n values refer to the number of mice 
used. Statistics calculated by two-way ANOVA with Sidak’s (a-c, e, g), Dunnett’s 
(d, i) or Tukey’s (k) post-hoc tests, or two-way repeated-measures ANOVA with 
Sidak’s post-hoc test (f, h,j, l). Graphs are mean +s.e.m. 
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Extended Data Fig. 4| Acute increase in body temperature is insufficient 
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to promote sociability. a, Body temperature profile after injection of vehicle 
or LPSin MIA offspring. Vehicle, n=10; LPS, n=10; from 4 independent 
experiments. The initial spike in body temperature is due to handling stress. 
b-e, Data are expressed as time spent investigating social (S) versus inanimate 
(I) objects (b), total interaction time (c), time spent in social (S), centre (C) or 
inanimate (I) chamber (d), and distance travelled (e) during three-chambered 
sociability experiments described in Fig. 1g.n=9 for all groups, from 
2independent experiments. f-j, Sociability performance in Vgat—Cre PBS and 
MIA offspring after treatment with vehicle, CNO or LPS. Data are expressed as 


per cent sociability (f), time spent investigating social (S) versus inanimate (I) 
objects (g), total interaction time (h), time spent in social (S), centre (C) or 
inanimate (I) chamber (i), and distance travelled (j) during three-chambered 
sociability experiments. PBS offspring, n=11; MIA offspring, n=7; from 

2 independent experiments. All n values refer to the number of mice used. 
Statistics calculated by two-way repeated-measures ANOVA with Bonferroni's (a) 
or Dunnett’s (f, h,j) post-hoc tests, two-way ANOVA with Sidak’s (b, g) or 
Dunnett’s (d, i) post-hoc tests, or one-way repeated-measures ANOVA with 
Tukey’s post-hoc test (c, e). Graphs are mean +S.e.m. 


AP -0.46 mm AP -0.46 mm 


Extended Data Fig. 5 | Histological identification of the S1DZ. a, Coronal b, Coronal section of the cortex imaged with differential interference contrast, 
section of the cortex counterstained with DAPI to highlight the abrupt further highlighting the reduced layer 4 in the SIDZ at AP -0.46 mm. n=3, from 
reduction in cell density in layer 4, between the SIDZ and the SIBF, at 2 independent experiments. All n values refer to the number of mice used. 


AP-0.46mm.n=5, from 1lindependent experiment. D, dorsal; V, ventral. White arrows indicate borders of S1DZ. Scale bars, 500 xm (a); 300 pm (b). 
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Extended Data Fig. 6|See next page for caption. 
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Extended Data Fig. 6 | Treatment of MIA offspring with LPS does not havea 
distinguishable effect on FOS expression in the other cortical regions 
analysed.a, Full cortical depth of SIDZ FOS staining as shown in Fig. 2a, for 

PBS and MIA offspring after administration of vehicle or LPS. PBS 
offspring + vehicle, n= 8; PBS offspring + LPS, n=9; MIA offspring + vehicle, 
n=13;MIA offspring + LPS, n=11; from 3 independent experiments. Scale bar, 
200 um. b,c, Representative images (b) and quantification (c) of FOS (green) 
and NeuN (red) colabelled cells within the SIDZ of PBS and MIA offspring. PBS 
offspring, n=4; MIA offspring, n =3; from 1independent experiment. Scale bar, 
50 um.d,e, Representative images (d) and quantification (e) of FOS expression 
ina series of cortical regions and after injection of vehicle or LPS in PBS or MIA 


offspring. Sections are stained for FOS (green) and DAPI (blue). Scale bars, 
200 um. For SIBF, M2, Mland AuD: PBS offspring + vehicle, n= 8; PBS 
offspring + LPS, n=9; MIA offspring + vehicle, n=13; MIA offspring + LPS, n=11. 
For mPFC: PBS offspring + vehicle, n= 8; PBS offspring + LPS,n=9; MIA 
offspring + vehicle, n=12; MIA offspring +LPS, n=11. For V1: PBS 
offspring + vehicle, n=7; PBS offspring + LPS, n=8; MIA offspring + vehicle, 
n=12;MIA offspring + LPS, n=9; from 4 independent experiments. Alln values 
refer to the number of mice used. Statistics calculated by unpaired two-tailed 
t-test (c) or two-way ANOVA with Tukey’s post-hoc test (e). Graphs are 
mean+s.e.m. 
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Extended Data Fig. 7 | Further behavioural analyses of S1DZ optical- 
inhibition-mediated rescue of sociability in monogenic mutant mice. 

a, Quantification of FOS-expressing cells inthe SIDZ of monogenic 

mutant mice. Wild type, n= 6; Cntnap2 mutant, n= 21; Fmri1 mutant, n=17; 
Shank3 mutant, n=15; from 5 independent experiments. b, Correlation of FOS 
expression in the S1DZ with severity of sociability deficits across monogenic 
mutant mice. Cntnap mutant, n=21; Fmr1 mutant, n=17; Shank3 mutant, n=15; 
from 4 independent experiments. Black solid lines represent regression line; 
grey lines indicate 90% confidence intervals. c, Individual data for experiments 
in Fig. 2f. d-f, Data are expressed as time spent investigating social (S) versus 


inanimate (I) objects (d), total interaction time (e), and distance travelled (f) 
during the three-chambered sociability experiments described in Fig. 2f. Wild 
type + EYFP,n=7; wild type + NpHR, n= 8; Cntnap2 mutant, + EYFP, n=11; 
Cntnap2 mutant + NpHR, n=9; Fmr1 mutant + EYFP, n= 8; Fmrl mutant + NpHR, 
n=12; Shank3 mutant + EYFP, n= 8; Shank3 mutant + NpHR, n=10; from 

6 independent experiments. Alln values refer to the number of mice used. 
Statistics calculated by one-way ANOVA with Dunnett’s post-hoc test (a), linear 
regression (b), one-way repeated-measures ANOVA with Dunnet’s post-hoc 
test (c), two-way ANOVA with Sidak’s post-hoc test (d) or two-way repeated- 
measures ANOVA with Sidak’s post-hoc test (e, f). Graphs are mean + s.e.m. 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | //17ra expression in the SIDZ of PBS and MIA offspring 
and further behavioural analyses of SIDZIL-17a rescue of sociability in 

MIA offspring and monogenic mutant mice. a, Representative images of 
/l17ra expression in the S1DZ of PBS and MIA offspring. Scale bar, 1mm. 

b, Quantification of /[17ra expression within the S1DZ of MIA offspring 
according to cortical layer. n=6, from 2 independent experiments. 

c, Quantification of overall //17ra expression in the SIDZ of PBS and MIA 
offspring. PBS offspring, n=8; MIA offspring, n=6; from 2 independent 
experiments. d-h, Further behavioural analyses of experiments described in 
Fig. 3f. Time spent investigating social (S) versus inanimate (I) objects (e), total 


interaction time (f), time spent in social (S), centre (C) or inanimate (I) chamber (g), 
and distance travelled (h). PBS offspring + vehicle, n= 11; PBS offspring + IL-17a, 
n=12;MIA offspring + vehicle, n=14; MIA offspring + IL-17a, n=10; wild 

type + vehicle, n=11; wild type + IL-17a, n= 11; Cntnap2 mutant + vehicle, n=8; 
Cntnap2 mutant + IL-17a, n=10; Fmr1 mutant + vehicle, n=9; Fmr1 mutant +1L-17a, 
n=11; from 6 independent experiments. Alln values refer to the number of mice 
used. Statistics calculated by unpaired two-tailed t-test (c), two-way ANOVA 
with Sidak’s (e) or Dunnett’s (g) post-hoc tests, or two-way repeated-measures 
ANOVA with Sidak’s post-hoc test (f, h). Graphs are mean +s.e.m. 
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Extended Data Fig. 9 | IL-17ais necessary for LPS-induced behaviouralrescue vehicle or LPS in MIA offspring pre-treated intracerebroventricularly with 


and reduction of FOS expression in MIA offspring. a—e, Further behavioural isotype-control antibody or blocking antibody against IL-17a (anti-IL-17a). PBS 
analyses of experiments described in Fig. 4a. Time spent investigating social offspring + vehicle + isotype, n=14; MIA offspring + vehicle + isotype, n=9; MIA 
(S) versus inanimate (I) objects (b), total interaction time (c), time spentin offspring + LPS + isotype, n=10; MIA offspring + LPS + anti-IL-17a, n=10; from 
social (S), centre (C) or inanimate (I) chamber (d), and distance travelled (e).PBS | 4independent experiments. All nvalues refer to the number of mice used. 
offspring + vehicle + isotype, n=9; PBS offspring + LPS + isotype, n=11; MIA Statistics calculated by two-way ANOVA with Sidak’s (b) or Dunnett's (d) post- 
offspring + vehicle + isotype, n=10; MIA offspring + LPS + isotype, n=10; MIA hoc tests, two-way repeated-measures ANOVA with Sidak’s post-hoc tests (c, e) 
offspring + LPS + anti-IL-17a, n=10; from 7 independent experiments. or one-way ANOVA with Dunnett’s post-hoc test (f). Graphs are mean +s.e.m. 


f, Quantification of FOS-expressing cells inthe SIDZ and CeA after injection of 


Article 


a 


LPS + Isotype LPS + anti-IL-17a 
c 
101 “078° 
*. 
a 2 g B 10° eget 
ao = + > ws ® PBS-1 
2 : 8 Pi ° PBS-2 
oD D D -4 ‘ 
& & £ 10 
5 5 a § ae 
1 “ge ° g 
Fy 8 10 ent B 10 ra 
‘= = woe < af 
oy ~ ote ° ay vac? 
az 6 3 0 igeoe® 3 Py EH oe 
Ss 0 a 410 ons®* a 10 oe eee 
4%, : ae © MIA-1 
2 # i Tce mete e MIA-2 
; , ge —- ° e P \eaa e 
10" 40-1 10° 101 10°" 4971 40° 101 10°" 40-1 10° 101 
Pre-injection firing rate (Hz) Pre-injection firing rate (Hz) Pre-injection firing rate (Hz) 
d e Ls i: ee _ MIA g MIA 
———S= OS 
i.p. injection ra o8 3% et 
(LPS) ° 2 La) Log 


400 


a fo?) 
oO oO 
Oo [om 
| | 
fe) 
85 
fe} 
oo 
fo} 
| 


an 


MIA (IL-17Ra™) 


200 
200 - 


oO 


Interaction Time (s) 
oo 
& 
'O 
° 
Total Interaction Time (s) 
Time Spent (s) 
nN & to2] 
oO oO oo 
Oo [= oO oO 
| | ] 
aes 
fe} 
@ 
fe} 
fo} 
we) 
Oo oO 
P00 
Oo fe} 


oO 

rt 
” 
ie) 
no 
oO 
no 
oO 
no 
oO 


SI(/SIISI|S1] 
Pre | Test | Pre | Test 


£8 &3 Pre | Test | Pre | Test 
DIL-17Ra™,EYFP 
W/L-17Ra™:EGFP:nCre 
h i j MA | 
© 2 24000-— 
= e 48000 
2 a § 12000—- 
a & & e ad 
® NOR = 
Q 7, oe ‘6 =63000—- 
s = = < 
a 
Pat 1500— 
a IN7ra Cll | 
a ——— 
e Gapdh | es o- 


Pre 
Test 
Pre 
Test 
W7ra 
Gapdh 


Extended Data Fig. 10 | See next page for caption. 


Extended Data Fig. 10 | Further analyses of the necessity of IL-17a for the 
LPS-induced reduction of firing rate in the S1DZ, and the necessity of SIDZ 
IL-17Ra expression for the LPS-induced rescue of sociability deficits in MIA 
offspring. a—c, Further analyses for experiments described in Fig. 4b-d. 

a, Example of a head-fixed mouse on the running wheel used during single-unit 
recording. b, Representative image of atetrode placement inthe S1DZ. Scale 
bar,500 pm.c, Firing rate for individual cells before and 4 h after injection of 
vehicle or LPS in PBS and MIA offspring pre-treated with isotype-control 
antibody or blocking antibody against IL-17a (anti-IL-17a). d-h, Further analyses 
for experiments described in Fig. 4e, f. Time spent investigating social (S) 


versus inanimate (I) objects (e), total interaction time (f), time spent in social 
(S), centre (C) or inanimate (1) chambers (g), and distance travelled (h). /-17ra™; 
EYFP,n=9; Il-17ra"";EGFP:nCre, n= 10; from5 independent experiments. 

i,j, Representative images (i) and corresponding quantification (j) of //17ra and 
Gapdh amplicon following PCR using cDNA derived from cells isolated from the 
cortical region centred on S1DZ of Il-17ra™";EYFP and Il-17ra"";EGFP:nCre mice. 
n=3 for both groups; from1 experiment. All n values refer to the number of 
mice used. Statistics calculated by two-way ANOVA with Sidak’s post-hoc test 
(e, g), two-way repeated-measures ANOVA with Sidak’s post-hoc tests, (f, h) or 
unpaired two-tailed ¢-test (j). Graphs are mean +s.e.m. 
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With ageing, normal human tissues experience an expansion of somatic clones that 
carry cancer mutations’ ’. However, whether such clonal expansion exists in the non- 
neoplastic intestine remains unknown. Here, using whole-exome sequencing data 
from 76 clonal human colon organoids, we identify a unique pattern of somatic 
mutagenesis in the inflamed epithelium of patients with ulcerative colitis. The 
affected epithelium accumulates somatic mutations in multiple genes that are related 
to IL-17 signalling—including NFKBIZ, ZC3H12A and PIGR, which are genes that are 
rarely affected in colon cancer. Targeted sequencing validates the pervasive spread of 
mutations that are related to IL-17 signalling. Unbiased CRISPR-based knockout 
screening in colon organoids reveals that the mutations confer resistance to the pro- 
apoptotic response that is induced by IL-17A. Some of these genetic mutations are 


known to exacerbate experimental colitis in mice 


8-1 and somatic mutagenesis in 


human colon epithelium may be causally linked to the inflammatory process. Our 
findings highlight a genetic landscape that adapts to a hostile microenvironment, and 
demonstrate its potential contribution to the pathogenesis of ulcerative colitis. 


Most tissues in our body constantly self-renew to maintain homeostasis. 
During repetitive cycles of self-renewal, somatic cells inevitably acquire 
genetic mutations®. Sequencing efforts have identified such somatic 
mutations in normal tissues at a clonal or near-clonal level’. Notably, 
these mutated genes often overlap with the driver genes of tumours 
that originate from the same tissues®. Genetic perturbation studies 
also support the concept that the acquisition of tissue-specific somatic 
mutations, or ‘somatic evolution’, confers cell-autonomous self-renewal 
capability and contributes to tumorigenesis. Given the importance of 
adaptation that is driven by the environment in Darwinian evolution, 
somatic evolution may also occur ina non-cell-autonomous manner as 
aconsequence of continuous pressure from the hostile tissue environ- 
ment. However, the biological intractability of mutant clones in normal 
tissues renders this question challenging to address. Recurrent muta- 
tions have only been captured as sequencing snapshots, and deriving 
living mutant clones has remained difficult owing to their rarity. 
Ulcerative colitis (UC) is an idiopathic chronic inflammatory bowel 
disease that is characterized by continuous inflammation starting from 
the rectum. Long-standing inflammation predisposes patients with UC 
to colorectal cancer, which suggests that somatic evolution occurs in 
UC. Sporadic emergence of 7P53-mutant clones in the non-neoplastic 
UC epithelium supports this hypothesis”. However, whether the UC 
epithelium reorganizes its genetic landscape in parallel with chronic 
inflammatory exposure has not yet been investigated. To resolve this 
issue, we here obtain a panoramic view of the genetic mutations in 


the UC epithelium using an organoid-based analysis of clonal somatic 
mutations. 


Somatic mutations in the UC epithelium 


To determine the mutagenic effect of chronic inflammation onnormal 
colorectal epithelium, we established non-neoplastic colorectal orga- 
noids from patients with UC, including patients with colitis-associated 
neoplasia (CAN), and from healthy control individuals. We prioritized 
organoids that were derived from patients with left-sided UC (that is, UC 
that involves the colonic segment distal to the splenic flexure) to obtain 
uninflamed (UC!™") and inflamed (UC'") tissue-derived organoids from 
the same donors. Overall, 170 normal organoid lines representing 55 
patients with UC and 16 healthy control individuals (Fig. 1, Extended 
Data Fig. 1a, b, Extended Data Table 1) were generated. Out of 55 patients 
with UC, 26 had concomitant CAN. We also established 18 CAN organoid 
lines. To capture inflammation-related mutations at the clonal level, 
we subsequently expanded clonal organoids from single cells from 
patients with UC and healthy control individuals, and subjected the 
organoids to whole-exome sequencing. The number of single nucleo- 
tide variants (SNVs) in the epithelium from healthy control individuals 
gradually increased with age (0.019 per Mb per year; 95% confidence 
interval 0.004-0.033; P= 0.017), inagreement witha previous report’ 
(Fig. 1b). Because the mutational burden in tissues affected by UC was 
expected to reflect cumulative inflammation rather than disease activity 
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Fig. 1| Somatic mutations in UC epithelium. a, Strategy for clonal detection of 
somatic mutations in UC epithelium. Specimens were obtained from healthy 
control individuals (HC), patients with UC without CAN and patients with UC 
with CAN. Thennumbers inthe top rowtop refer to the number of patients; 
those in the bottom two rows refer to the number of organoid lines. b, The 
relationship between the number of SNVs and age (left, middle) or duration of 
disease (right) in healthy control (n=28), UC!" (n=15), UC" without CAN 


at the point of sampling, we divided UC" organoids on the basis of CAN 
concomitance to designate organoids witha history of long-term active 
inflammation. The rate of mutation did not significantly vary among the 
groupsinalinear mixed model that incorporated age and disease status 
as fixed effects (Fig. 1b). The number of SNVs was also not correlated 
with endoscopic severity at the time of sampling. We next assessed the 
effect of disease duration, as this factor varies among patients with 
UC. The SNV count in UC" organoids that were derived from patients 
with CAN was positively correlated with disease duration (0.10 per Mb 
per year; 95% confidence interval 0.06-0.14; P= 0.0017) (Fig. 1b). This 
mutation rate was significantly higher than that in UC" organoids, 
even if the interaction between age and disease duration was considered 
(P=0.0032). These results suggest that chronic inflammation induced 
genetic mutations independently of ageing, but that other factors not 
related to inflammation—such as patient medication and the inherent 
genetic mutability of the host—may have confounded the mutational 
loads. To exclude such potential bias, we compared SNV counts between 
paired UC and UC" organoids that were derived from patients with 
left-sided UC. This intra-patient comparison demonstrated that the 
number of SNVs in UC'" organoids was nominally higher than that in 
Uc!" organoids, whereas SNV counts in organoids from healthy control 
individuals were comparable between the right and left colons (Fig. 1c). 
Wenext determined whether UC inflammation is related to specific muta- 
tional signatures”. Regardless of inflammation, the organoids acquired 
ageing-associated nucleotide transitions (Extended Data Fig. 2a). Despite 
being derived fromanon-neoplastic tissue, UCO8 organoids contained an 
MLH1I™™ mutation and showed mutational signatures that are associated 
witha deficiency in mismatch repair. UC10T_IS organoids also had APOBEC 
mutational signatures (Extended Data Fig. 2b). Apart from these lines, an 
association between specific mutational signatures and inflammation was 
not evident (Extended Data Fig. 2b). Collectively, these results suggest that 
UC inflammation produces a modest increase in the overall number of 
mutations, without a bias towards specific mutational signatures. 


Genomic landscape of the UC epithelium 


To investigate whether functional genetic mutations were enriched 
in UC" organoids, we focused on truncating mutations, including 


(n=15) and UC" with CAN (n=15) samples. Dotted lines show linear regression 
as determined by the linear mixed model. c, Intra-patient comparison of 
numbers of SNVs between the left and right colons in healthy control 
individuals (n =12) and patients with left-sided UC (n=14). Pvalues determined 
by two-sided Wilcoxon signed-rank test. d, Genetic mutations identified in 
organoid clones. UC organoids that were derived from patients with CAN are 
labelled in black. Hetero, heterozygous. 


nonsense, frame-shift and splice site mutations. In contrast to healthy 
organoids, UC'™ organoids acquired recurrent truncating mutations 
in two genes that are associated with mucosal immunity—NFKB/Z and 
PIGR (Fig. 1d). Five UC'* organoid clones acquired biallelic NFKBIZ 
or PIGR truncating mutations, which highlights the inflammation- 
specific selection of these mutations (Fig. 1d). The gross number of 
NFKBiZ truncating mutations in UC™ organoids was significantly higher 
compared to other truncating mutations, even in this small sample 
size (Extended Data Fig. 2c). Furthermore, SNVs in NFKBIZ and PIGR 
showed high dN/dS ratios (the ratio of non-synonymous to synony- 
mous substitutions)", demonstrating a bias towards non-synonymous, 
protein-damaging mutations over neutral synonymous mutations 
(Extended Data Fig. 2d). In two UC" organoids, truncating mutations 
in NFKBIZ and TRAF3/P2 were accompanied by loss of heterozygosity 
(LOH; Fig. 1d). Whole-exome sequencing data also identified LOH in 7 
out of 31 UC" organoids, which predominantly spanned recurrently 
mutated loci: chromosomes 1q (PIGR), 3q (NFKBIZ) and 6q (TRAF3/P2) 
(Extended Data Fig. 3a). ADNA copy-number analysis confirmed pre- 
dicted chromosomal losses in two out of seven UC'™ organoids with 
LOH (Extended Data Fig. 3b). These two clones exhibited aneuploidy, 
whereas LOH regions in the other five euploid lines showed copy-neu- 
tral LOH (CN-LOH) (Extended Data Fig. 3b). One UC line (UC12T_IA), 
which had CN-LOH in the P/GR locus without a somatic mutation inthe 
PIGR coding region, lacked expression of the PIGR protein—emphasiz- 
ing the role of LOH in genetic disruption (Extended Data Fig. 3c). Wealso 
detected focal deletions at fragile sites exclusively in UC'™ organoids 
(Extended Data Fig. 3d). These results indicate that the UC epithelium 
is prone to genomic instability, and that this eventually contributes to 
the formation of a unique genomic landscape. 


Recurrent IL-17 pathway mutations in UC 

We next investigated biological pathways that are affected by UC- 
related mutations. NFKBIZ encodes IkBZ, which is upregulated by NF-KB 
signal activation and enhances the activation of NF-kB by regulating 
positive feedback’*”’. PIGR is a cell-surface molecule that regulates 
transcytosis and the secretion of dimeric immunoglobulin A (IgA)”. 
Previous studies have revealed that PIGR is upregulated by IL-17-related 
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Fig. 2| Targeted sequencing of bulk UC organoids. a, The IL-17 signalling 
pathway. Genes that are mutated in UC are in red. b, IL-17A-responsive genes in 
normal colon organoids (n =3). Genes with P< 0.01 (empirical Bayes method) 
are shown inred. Bold text indicates genes that are mutated in UC. c, Genes that 
are differentially expressed between purified UC (n=11) and healthy (n=11) 
epithelium (E-MTAB-5464). Genes with adjusted P< 0.01 (exact negative 
binomial test followed by the Benjamin-Hochberg procedure) are shownin 
red. Bold text indicates genes that are mutated in UC. CPM, counts per million. 
d, Genetic mutations detected in UC" organoids by targeted sequencing. Top, 
for each gene, the number of different variants is shown by density. Bottom, 


activation of NF-KB!®!”, TRAF3IP2 (also known as ACTI) serves as an 
adaptor of the IL-17 receptor and is essential for downstream activation 
of NF-KB signalling”. Recurrently mutated genes in the inflamed UC 
epithelium therefore appear to converge on the IL-17-NF-kB signalling 
pathway (Fig. 2a). In support of this, IL-17A stimulation upregulated 
NFKBIZ, PIGR and other canonical targets of IL-17A, including CXCL1 and 
LCN2, in colonic organoids (Fig. 2b). A reanalysis of published datasets 
revealed upregulation of NFKBIZ mRNA and other IL-17A target genes 
in tissues with UC (Fig. 2c, Extended Data Fig. 4a, b). Together, these 
results suggest that IL-17 signal activation upregulates the expression 
of IkB¢ and PIGR in the UC epithelium. 

To determine whether IkBZis a critical regulator of IL-17 signal activa- 
tion inthe colonic epithelium, we treated wild-type, NFKB/Z-mutant and 
TRAF3IP2-mutant UC organoids with IL-17A. PIGR was not upregulated 
in NFKBIZ- and TRAF3/P2-mutant organoids that were treated with IL-17A 
(Extended Data Fig. 4c-h), highlighting the convergence of mutations 
in the IL-17 and IkBC signalling pathways during UC-related inflamma- 
tion. Of note, mutations in NFKB/Zand PIGR were rare or heterozygous 
in CANs in our cohort (Extended Data Fig. 4i)—as well as in previous 
exome-sequencing analyses”! **—which suggests that alteration of 
these genes is unlikely to initiate colitis-associated tumorigenesis. 


The mutational spectrum in UC epithelium 

To assess the prevalence of UC-associated mutations, we subjected 
short-term cultures of epithelium-enriched polyclonal UC'™, UC2™"" 
and control organoids to targeted sequencing of 37 genes that com- 
prise components of the IL-17 signalling pathway and sporadic drivers 
of colon cancer (Fig. 2a, Supplementary Table 2). The analysis con- 
firmed that NFKB/Zand PIGR mutations were frequent in UC™ organoids 
(Fig. 2d, Supplementary Table 2). Variant-allele frequencies (VAFs) 
were similar between the original epithelium tissues and the orga- 
noids, which suggests that selection bias was minimal during organoid 


256 | Nature | Vol577 | 9 January 2020 


dots represent the VAF of each variant. Genes are shown in different colours as 
specified on the left. e, PIGR (top) and secretory IgA (sIgA) (bottom) 
immunostaining of a UC" tissue. A P/GR-mutant organoid line (UC12T_IT) was 
derived from the adjacent region. Magnified areas show boundaries between 
regions in which PIGR expression is intact and regions in which PIGR expression 
is lost. Representative result from n=8 patients analysed. Scale bars, 500 um 
(low magnification); 100 pm (high magnification). f, ZC3H12A mutations 
identified by targeted sequencing. All mutations in UC organoids were located 
in the last exon (black). fs, frameshift; *, nonsense mutation. 


derivation (Extended Data Fig. 5a). In three UC'" organoids, we iden- 
tified five /L17RA mutations that were not detected by whole-exome 
sequencing. Multiple disparate truncating mutations often co-existed 
inatiny biopsy sample, with P/GR, ARIDIA and NFKB/Z mutations being 
prevalent. The high dN/dS ratios of these mutations indicated subse- 
quent functional effects (Extended Data Fig. 5b). Targeted sequencing 
also detected UC-associated mutations in UC?" organoids, but the 
medical records of the respective patients revealed a past history of 
inflammation in most of the regions that were sampled—suggesting 
that previous inflammation has a role in the selection of UC-associated 
mutations (Extended Data Fig. Sc). 

On the basis of the high frequency of mutations in PIGR, we next 
examined the expression of PIGR in the colonic tissues adjacent to the 
sampled sites. PIGR was ubiquitously expressed in healthy and UC" 
epithelium, whereas UC" tissues exhibited a focal loss of PIGR that was 
attributable to PIGR mutations (Fig. 2e, Extended Data Fig. 5d). The 
concomitant depletion of secretory IgA in these regions reinforced 
the functional loss of PIGR (Fig. 2e, Extended Data Fig. 5d). Notably, 
several studies from the 1970s and 1980s documented idiopathic loss 
of epithelial IgA and PIGR in patients with UC? ~”. The frequent somatic 
PIGR mutations could constitute a previously unknown mechanism for 
the sporadic disappearance of the IgA system in the UC epithelium. 

Target sequencing also identified a ZC3H12A°**" mutation in seven 
UC'™ organoids. ZC3H12A, which is also known as Regnase-l, is an 
IL-17-responsive RNA-binding protein that dismantles the mRNA of 
inflammation-related genes, including NFKBIZ and IL17RA”*.. Phos- 
phorylation of serine residues in its DSGXXS phospho-degron motif, 
in which $438 is situated (Fig. 2f), recruits the ubiquitin machinery 
to degrade the ZC3HI12A protein. As the S438L mutation blocks this 
phosphorylation process”, this acts as a gain-of-function mutationin 
ZC3H12A and results in the stabilization of the protein. One UC" orga- 
noid line carried aZC3H12A°“” mutation that affects the DSGXXS motif 
(Fig. 2f), and this organoid line exhibited a lower response to IL-17A than 
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Fig. 3 | Phenotypic characterization of UC-related mutations. a, Strategy for 
screening for genetic mutations that confer IL-17A resistance. b, Enrichment of 
sgRNA-targeted organoids after four weeks of treatment with IL-17A. Fold- 
change values relative to untreated organoids are shown. Each dot 
corresponds to onesgRNA sequence. Bars represent mean fold-change values. 
Data are representative of n = 2 biologically independent experiments with 
similar results. Pvalues determined by two-sided Mann-Whitney Utest.c, 


did patient-matched wild-type ZC3H12A organoids (Extended Data 
Fig. 6a, b). We also generated a ZC3H12A°*” knock-in (KI) line of orga- 
noids (ZC3H12A°*”""') and found that this showed a similar response 
to IL-17A (Extended Data Fig. 6c-e), verifying the gain of function that 
is conferred by the D437Y mutation. These results suggest that gain- 
of-function hotspot mutations, in analogy to oncogenic mutations in 
cancers, can occur innon-dysplastic human epithelium. 

To exclude the potential overrepresentation of specific mutations 
owing to our use of multiple clones derived from the same patients, we 
reanalysed the targeted sequencing data by selecting the most-distal 
sample for each patient. This per-patient analysis indicated that around 
65% of the patients with UC and concomitant CAN had at least one of 
the IL-17-signalling-related mutations in an area of epithelium of less 
than 5 mm? (Extended Data Fig. 6f). 

Given that NFKBIZ, ZC3H12A and PIGR could be upregulated by 
cytokines other thanIL-17A, we comprehensively examined mutations 
in cytokine and pathogen-associated molecular pattern (PAMP) recep- 
tor genes by targeted sequencing. Although none of the 76 selected 
cytokine or PAMP receptor genes was mutated in more than two sam- 
ples, several truncating mutations were identified (Extended Data 
Fig. 7a). Of the genes with truncating mutations, OSMR, ILISRA and 
TLR3 were found to be expressed at substantial levels in the colonic 
epithelium®°. Nonetheless, genetic analysis ona larger scale will be 
required to confirm their enrichment during inflammation. 

Targeted sequencing also detected sporadic colon cancer-related 
mutations in both UC and control epithelium, with higher incidence 
in the UC epithelium (Extended Data Fig. 7b, c). Cancer-associated 
mutations that were identified in UC organoids were also present 
in CAC organoids, which suggests that these mutations have a selec- 
tion advantage under conditions of chronic inflammation (Extended 
Data Fig. 7d). 


IL-17A cytotoxicity in colonic epithelium 


The frequent mutations that occur in the IL-17 signalling pathway sug- 
gested the presence of a mutation-driven selection bias during a pro- 
longed period of inflammation. To determine whether IL-17 signalling 
elicits cytotoxic effects in the colonic epithelium, we treated colonic 
organoids withIL-17A. Contrary to our expectation, a week-long treat- 
ment with IL-17A marginally affected organoid growth, which ham- 
pered the functional assessment of UC-associated mutations. We thus 
turned to an agnostic CRISPR-based knockout screening (Fig. 3a). This 
approach sensitively identified genes the loss of which confers a sur- 
vival advantage during IL-17A selection, as demonstrated by an enrich- 
ment of /L17RA and NFKBIZ single-guide (sg)RNAs. (Fig. 3b, Extended 
Data Fig. 8a). Of note, PIGR sgRNAs were not enriched, which suggests 
that growth selection by P/GR mutations was not mediated directly 
by IL-17A cytotoxicity but requires an IgA-related immune response. 


targeting 


Sensitization of organoids to IL-17A by withdrawal of Noggin, and its reversal by 
pre-treating organoids with a neutralizing anti-IL-17A antibody (Ab). Right, dots 
show the relative area of organoids in each well. WT, wild type. Data are 
representative of n=3 biologically independent experiments with similar 
results, and are shown as mean + s.e.m. Pvalue determined by two-sided 
Welch’s t-test. Scale bars, 1mm. 


We next asked whether specific medium conditions evoke IL-17A-me- 
diated organoid death. Screening of growth factor conditions demon- 
strated that Noggin deficiency sensitizes organoids to treatment with 
IL-17A (Extended Data Fig. 8b). An anti-IL-17A neutralizing antibody 
or genetic disruption of /L17RA counteracted this cytotoxic response 
(Fig. 3c, Extended Data Fig. 8c, d). Both NFKBIZ knockout (KO) and 
patient-derived NFKB/Z-mutant organoid lines exhibited resistance 
to treatment with IL-17A (Extended Data Fig. 8c, e, f). Furthermore, 
TRAF3IP2© and ZC3H12A°*”’ organoids showed a similar response 
(Extended Data Fig. 8c, d, g). TP53° organoids also tolerated IL-17A 
stimulation (Extended Data Fig. 8h), which may account for the emer- 
gence of 7P53-mutant clones in non-neoplastic UC epithelium and 
nascent CANs. These results together suggest that IL-17A is detrimental 
for the epithelial lining, and that clones withIL-17A pathway mutations 
are able to avert IL-17A-mediated cytotoxicity and thereby selectively 
expand in tissue that is affected by UC. 


Mechanism of IL-17A-induced cytotoxicity 


Anincrease in cells that express cleaved caspase-3 in wild-type, but not 
NFKBIZ®, organoids as a result of treatment with IL-17A indicated an 
activation of the apoptotic pathway (Fig. 4a, Extended Data Fig. 9a). 
Transcriptome analysis revealed a biased induction of NOS2 (iNOS, 
inducible NO synthase) in a Noggin-deficient condition (Fig. 4c), sug- 
gesting that NOS2 has a role in IL-17A-induced apoptosis. Treatment 
withiNOS inhibitors suppressed IL-17A-induced growth arrest (Fig. 4d). 
IL-17A stimulation failed to induce NOS2 expression in NFKBIZ©, 
TRAF3IP2” and IL17RA® organoids (Extended Data Fig. 9b), consistent 
with their resistance to IL-17A. Of note, targeted sequencing detected 
one NOS2truncating mutationin UC organoids (Fig. 2d).Inagreement 
with previous studies” *, iNOS was highly expressed at the mucosal 
surface of UC" tissue (where BMP signalling is active), but its expres- 
sion was scarce in healthy tissue (Fig. 4f). Together, these data sug- 
gest that in tissue with UC, IL-17A transmits iNOS- and NO-dependent 
pro-apoptotic signals to the superficial layer of the epithelium and 
subsequently drives the selective expansion of IL-17A-resistant mutant 
clones (Fig. 4g). 


Discussion 

In contrast to the pervasive spread of cancer mutations in the haema- 
topoietic system’”, skin® and the epithelium of the oesophagus”, the 
UC epithelium accrues tumour-irrelevant mutations that are presumed 
to facilitate selective expansion of these cells under inflammatory con- 
ditions. Using organoid culture, we pinpointed IL-17 signalling as one 
of the key pathways that drives this somatic evolution. IL-17A elicited 
a pro-apoptotic response in normal colonic organoids, and loss-of- 
function mutations in positive regulators of IL-17 signalling—namely 
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Fig. 4 iNOS mediates IL-17A-induced apoptosis. a, Expression of cleaved 
caspase-3 inIL-17A-treated wild-type and NFKBIZ organoids. Images are 
representative of n=2 technical replicates with similar results. Scale bars, 

100 pm. b, Expression of IL-17A target genes after IL-17A stimulation in the 
absence or presence of Noggin. The colour scale shows the fold change in gene 
expression that is induced by Noggin withdrawal in the presence of IL-17A.c, 
The iNOS inhibitors LNAME and 1400Wattenuate IL-17A-induced cytotoxicity. 


NFKBIZ, IL17RA and TRAF3IP2—were enriched in the UC epithelium. 
Unlike these mutations, alterations in P/GR did not translate into pro- 
tection from IL-17A-mediated cytotoxicity. Although IL-17A induced 
the expression of PIGR in organoids, high levels of PIGR were pre- 
sent in the colonic epithelium independently of IL-17A stimulation, 
suggesting that there is room for debate on the convergence of IL-17 
signalling mutations. The system that we used for organoid culture 
lacks IgA and related components of the immune system, which makes 
it challenging to investigate the effect of PIGR mutations during the 
selection process. 

By contrast with the reduction of the IL-17A-induced pro-apoptotic 
response that was observed here in human organoids with UC-related 
mutations, loss-of-function mutations of /l17ra®, Pigr’ or Traf3ip2” (and 
gain-of-function of Zc3h12a") exacerbate experimental colitis in geneti- 
cally engineered mice. IL-17 signalling was associated with mucosal 
integrity’°. Disruption of the IL-17-PIGR pathway provoked dysbiosis 
and intensified the host immune response®’. Furthermore, a previous 
clinical trial that aimed to test the efficacy of an anti-IL-17A antibody for 
the treatment of inflammatory bowel disease was prematurely discon- 
tinued asa result of the aggravation of colitis in trial participants™*. It 
is thus tempting to speculate that sporadic UC-associated mutations 
confer alocal survival advantage under chronic inflammation, whereas 
extensive expansion of mutant clones may paradoxically worsen the 
activity of UC. Notably, some of the somatically mutated genes in the 
UC epithelium are also associated with susceptibility to UC—including 
NFKBIZ(1s616597), TRAF3IP2 (rs3851228), PIGR (ts3024505) and NOS2 
(rs2945412)*—which suggests that altered function of these genes 
may also contribute to the onset of UC. Further studies are warranted 
to clarify how these UC-related genes pertain to disease aetiology and 
pathophysiology on a regional, tissue-wide and systemic level. 

In conclusion, our genetic profiling of UC organoids has unveiled 
a pattern of somatic evolution during chronic tissue inflammation. 
Our findings provide a blueprint for how clones that are adapted to 
an inflammatory microenvironment emerge and expand ina normal 
tissue, and will prompt future research onthe role of such clones inthe 
pathology of human diseases. 
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Right, dots show the relative area of organoids in each well. Data are shownas 
mean +s.e.m. Pvalue determined by two-sided Welch’s t-test. Scale bars, 1mm. 
d, iNOS immunostaining in healthy control and UC tissues. Representative 
images fromn=3 (HC) and n=6 (UC) samples with similar staining patterns. 
Scale bars, 100 pm. e, Proposed role of UC-associated somatic mutations in 
resistance to IL-17A-induced apoptosis. 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Patient enrolment and tissue collection 

Human intestinal samples were obtained from healthy control individu- 
als or patients with UC who underwent colonoscopy or surgery at Keio 
University Hospital or Tokyo University Hospital, with written informed 
consent under approval of the ethical committees (approval numbers: 
20120057, 20130512, 20170170, 20170171 and G3553-(7)). UC was classi- 
fied into proctitis, left-sided colitis or extensive colitis according to the 
standard clinical criteria®’, and disease distribution was determined by 
macroscopic or endoscopic diagnosis at the time of tissue sampling. 
Whenever possible, multiple mucosal tissues were sampled from dif- 
fering anatomical segments of the same patient. CAN specimens were 
obtained in parallel with normal counterparts from patients who were 
histologically diagnosed with CAN. Normal tissues of patients with CAN 
were sampled from macroscopically normal regions with gross margins 
of at least 33cm from the tumours. Collected tissues were transferred into 
ice-cold phosphate-buffered saline (PBS) and placed on ice until use. 
Reference blood samples were obtained from all patients. Patient demo- 
graphics and sample information are available in Extended Data Table 1. 


Crypt isolation and organoid culture 
Crypt isolation and organoid culture from samples of human large 
intestine were carried out as previously described?” following dif- 
ferent procedures depending on sample types. For healthy control 
samples, the stroma was removed using fine scissors and the remain- 
ing epithelium was shredded into 1-mm?’ pieces. Epithelial fragments 
were washed vigorously at least five times with ice-cold PBS until the 
supernatant was free of debris and were treated with 2.5 mM EDTA 
(Thermo Fisher Scientific) for 1h at 4 °C with gentle rocking to release 
crypts. Liberated crypts were suspended in Matrigel and dispensed 
onto 48-well plates as 25-1l droplets. After Matrigel polymerization, 
crypts were overlaid with the medium as described below. Advanced 
Dulbecco’s modified Eagle’s medium/F12 (Thermo Fisher Scientific) 
was supplemented with penicillin and streptomycin (Thermo Fisher 
Scientific), 10 mM HEPES (Thermo Fisher Scientific), 2 mM GlutaMAX 
(Thermo Fisher Scientific), 1 x B-27 supplement (Thermo Fisher Scien- 
tific), 10 nM gastrin I (Sigma-Aldrich), 1 mM N-acetylcysteine (Sigma- 
Aldrich), 100 ng ml‘ recombinant mouse Noggin (PeproTech), 50 ng 
ml recombinant mouse EGF (Thermo Fisher Scientific), 100 ng mI 
recombinant human IGF-1 (BioLegend), 50 ng mI recombinant 
human FGF-basic (FGF-2) (Peprotech), 1 pg ml recombinant human 
R-spondin-1 (R&D), 500 nM A83-01 (Tocris) and 20% Afamin/WNT3A 
serum-free conditioned medium”. For UC", UC" and CAN samples, 
fragmented mucosal tissues were digested in Liberase TH Research 
Grade (Roche) for 1h at 37 °C. The remaining fragments were further 
dissociated in TrypLE Express (Thermo Fisher Scientific) for 20 min 
at 37 °C. For some samples, half of the processed tissue was spared 
and used for targeted sequencing. Digested tissue was embedded in 
Matrigel and, regardless of the sample type and inflammation sta- 
tus, maintained in the unified culture medium described above. The 
medium was refreshed every two days and the organoids were passaged 
oncea week witha split ratio of 1:10 using TrypLE Express. The medium 
was supplemented with 10 LM Y-27632 (FUJIFILM Wako Pure Chemical 
Corporation) fortwo days after establishment and passaging, and 5 uM 
CHIR99021 (Cayman Chemical) for seven days after establishment. 
To evaluate the effect of IL-17A treatment on organoid growth ina 
Noggin-deficient condition, the organoids were single-cell-passaged 
andallowed to growin the absence of Noggin with or without 100 ng mI 
IL-17A. The organoids were cultured for 6-8 days. For iNOS blockade, 


organoids were treated an IL-17A neutralizing antibody (Secukinumab, 
100 nM) from passaging, or with L-NAME (1 mM) or 1400W (500 mM) 
during IL-17A stimulation. The images of each well were captured using 
aBZ-X800 digital microscope (Keyence). The organoid area was meas- 
ured using BX-X800 Analyzer software (Keyence). 


Exome sequencing 

DNA was extracted from organoids or blood with the QlAamp Blood 
Mini Kit (Qiagen) and treated with RNase Cocktail (Ambion). The quality 
of DNA samples was confirmed by agarose gel electrophoresis within 
the expected range of DNA fragmentation. Three micrograms of 
genomic DNA was fragmented for whole-exome sequencing analyses 
as previously described*°. Whole-exome sequencing libraries were gen- 
erated by the SureSelect Human All Exon VS or V6 kit (Agilent) according 
to the manufacturer’s instructions and enriched exon fragments were 
sequenced by Illumina HiSeq2500 (outsourced to Hokkaido System 
Science) or HiSeq4000 (outsourced to BGIJAPAN K.K.) in 150-bp paired- 
end mode. Cleaned fastq files were mapped onto the human reference 
genome version GRCh37 (hg19) using Burrows-Wheeler Aligner (BWA, 
v.0.7.8). Generated reads were passed to the Genome Analysis Toolkit 
(GATK, v.1.6.13). SNVs, insertions and deletions were called using the 
UnifiedGenotyper tool in GATK. Variants were annotated using SnpEff 
(v.3). For analysis of numbers of mutations, raw variants were filtered 
by removing SNVs that were detected in the paired blood samples 
with VAF > 0.03. The filtered variants in autosomal chromosomes 
were counted. UC samples with a MLHI1 mutation were excluded from 
the analysis. UCO6_IS were also excluded owing to the use of different 
versions of exon-capture kits between organoids and reference blood 
samples. A linear regression model was fitted using the Ime functionin 
the R package nlme (v.3.1) to estimate the effect of age, disease dura- 
tion and group on the number of SNVs. For detection of mutations, 
raw variants were filtered by removing those registered in the SNV 
databases (dbSNP build 130 or lower), in the 1OOO Genome Project 
witha frequency of >0.05 or inthe Japanese SNP database (the Human 
Genetic Variation Database (HGVD, release v.2.30). Filtered data were 
used to calculate dN/dS ratios using the R package dndscv (v.0.0.1.0). 
The truncating mutations in the filtered data were assessed by Fisher’s 
exact test. Somatic mutations identified by whole-exome sequencing 
and dndscv results are available in Supplementary Table 1. 


Mutational signatures 

The nucleotide variants that were used for mutation-number analysis 
were also subjected to an analysis of mutational signatures. In this 
analysis, we included organoids with MLHI mutations (UCO8T_IL1, 
2). Using the R package MutationalPatterns (v.1.8.0), the occurrence 
of six nucleotide changes was counted for each group (UC, UC!""*, 
Uci" without CAN, UC with CAN), and the cosine similarity“ to the 
COSMIC signatures” was calculated. Hierarchical clustering was per- 
formed using Euclidean distance and the complete linkage method. 


Copy-number analysis 

For copy-number analysis, 500 ng of genomic DNA was applied to com- 
parative genomic hybridization (CGH)-SNP microarray (CytoScan HD, 
Affymetrix), according to the manufacturer’s instructions. We used 
‘apt-copynumber-cyto-ssa’ in Affymetrix Power Tools (v.1.19.0) with 
the normal-diploid-normalization option to obtain probe-level copy 
number and allele difference. All parameters were set according tothe 
recommendation in user manual. Segmentation analysis was performed 
with the segmentCGH function in the R Bioconductor package rCGH 
(v.1.8.1) using log,-transformed ratio values of copy-number probes. 


Gene expression microarray analysis and gene set enrichment 
analysis 

Three independent normal colonic organoid lines that were established 
previously as counterparts of colon cancer organoids* were used for 


gene expression microarray analysis. The organoids were cultured from 
single cells for five days and then treated with or without 100 ng mI“ IL- 
17A for 24 h. Two of these organoid lines were also used to evaluate the 
effect of Noggin withdrawal on IL-17A response. Inthis experiment, the 
organoids were cultured in Noggin-included or Noggin-deficient condi- 
tions for five days and IL-17A was added 24 h before RNA collection. RNA 
was extracted from organoids using the RNeasy Plus Mini Kit (Qiagen). 
RNA quality was measured using Bioanalyzer (Agilent), and all samples 
passed the quality check (RNA integrity number > 7.0). Gene expression 
profiling was performed using the PrimeView Human Gene Expression 
Array (Affymetrix) according to the manufacturer’s instructions. Raw 
expression data were normalized using the rma function in the affy R 
Bioconductor package (v.1.56.0). Linear model fitting and the empirical 
Bayes method implemented in the R/Bioconductor package limma” 
was used to compare IL-17A-treated and -untreated samples ina pairwise 
fashion. Forty-nine genes with P values <0.05 and fold-change values 
>1.5 were selected as in-house IL-17A target genes (Supplementary 
Table 3). Gene set enrichment analysis was performed using this IL-17A 
target gene set and public datasets with data on gene expression in UC- 
affected mucosa (GSE38713* and GSE59071"*). Enrichment of the genes 
inactive colitis samples versus healthy control samples was evaluated 
using the gene set enrichment analysis module (v.19.0.25) in GenePat- 
tern (https://cloud.genepattern.org/gp). Transcriptomic analysis of 
purified intestinal epithelial cells from patients with UC was performed 
using a public dataset. The read-count matrix was downloaded from 
ArrayExpress (study ID: E-MTAB-5464). We examined expression data 
of sigmoid colon epithelium from 11 patients with UC and 11 healthy 
control individuals. Exact negative binomial test was performed using 
the R Bioconductor package edgeR (v.3.24.3). 


Targeted sequencing 

We used short-term-cultured (fewer than four passages) bulk orga- 
noids and fresh epithelium tissues for targeted sequencing. DNA was 
extracted using the QlAamp Blood Mini Kit (Qiagen) and treated with 
RNase Cocktail (Ambion). The targeted genes consist of genes that 
were recurrently mutated in the UC organoid whole-exome sequencing 
data; IL-17 signalling-related genes (based on the literature); cytokine 
or PAMP receptor genes; UC risk genes identified in previous genome- 
wide association studies; and colon cancer driver genes, as listed in 
Supplementary Table 2. Cytokine and PAMP receptors were extracted 
from data from the FANTOMS project*® and genes with counts per mil- 
lion greater than 1in at least 50% of sorted UC and control epithelium 
samples” were used for sequencing. Molecular barcode-based high- 
sensitivity variant detection was performed using the HaloPlex HS 
target enrichment system (Agilent Technologies). Custom probe panels 
for Illumina sequencers were designed using the SureDesign online 
tool (Agilent Technologies). The libraries were sequenced with 150-bp 
paired reads using a MiSeq sequencer (Illumina) and MiSeq Reagent Kit 
v2 (300 cycles) (Illumina). Obtained fastq files were analysed by Agilent 
SureCall 3.5 (Agilent) using the default settings. In brief, cleaned fastq 
files were mapped onto the human reference genome version GRCh37 
(hg19) with BWA-MEM. SNVs were called using the SNPPET SNP caller. 
High-confidence regions with at least 250 total reads and 5 or more 
reads with SNVs were used for analysis. Raw variants were annotated 
by SureCall with default settings. After the SureCall analysis, SnpEff 
(v.4.3t) annotation was performed to specifically detect splicing donors 
and acceptors. Raw variants were filtered by removing variants regis- 
tered in HGVD and common variants detected in the pooled targeted 
sequencing data of four blood samples in this study. Hotspot muta- 
tions were defined as SNVs with more than 30 registered cases in the 
COSMIC database. D437, S438 and R512* mutations in ZC3H12A were 
also included as hotspot mutations. To exclude potential sequencing 
errors, we selected truncation mutations that were uniquely identified 
in our targeted sequencing study except for hotspot mutations. These 
data were also used to calculate dN/dS ratios using the R package dndscv 


(v.0.0.1.0). Mutations detected by target sequencing and the related 
data are available in Supplementary Table 2. 


Quantitative PCR with reverse transcription 

To analyse the induction of NFKB/Zand NOS2by treatment with IL-17A, 
we treated patient-derived wild-type, NFKB/Z-mutant, TRAF3/P-mutant 
and ZC3H12A°*” organoids, as well as genetically engineered NFKBIZ®, 
TRAF3IP2©, ILI7RA® and ZC3H12A°#' organoids, with 100 ng mI IL- 
17A (R&D systems) for 3 h(for NFKBIZ) and 24 h (for NOS2). In specific 
experiments, the organoids were expanded in a Noggin-deficient condi- 
tion. Total RNA was extracted from organoids using the RNeasy Mini Kit 
(Qiagen) and reverse-transcribed using the Omniscript RT Kit (Qiagen) 
according to the manufacturer’s instructions. Quantitative PCR was 
performed using FastStart Essential DNA Probes Master (Roche) and 
Universal Probe Library (UPL) probes, and fluorescence was detected 
with a LightCycler 96 device (Roche). Relative gene expression levels 
were calculated with the AAC, method using ACTB as a housekeeping 
gene. UPL probe numbers and primer sequences are available in Sup- 
plementary Table 3. 


Capillary-based immunodetection 

Patient-derived NFKB/Z-mutant (UCO2N IS), TRAF3/P2-mutant (UCO7T_ 
IT) and ZC3H12A°#” (UC23N_IS) organoids, as well as genetically engi- 
neered NFKBIZ® organoids, together with their wild-type counterparts 
(UCO2N_UA, UCO7T_IR and UC23N_UA), were treated with or without 
100 ng mI" IL-17A for 24 hand collected using Cell Recovery Solution 
(Corning). Total protein was extracted using Cell Lysis Buffer (Cell 
Signaling Technology) supplemented with Halt Protease and Phos- 
phatase Inhibitor Cocktail (Thermo Fisher Scientific). IkBZ, PIGR, IL- 
17RA and B-actin proteins were detected using a Wes capillary-based 
device (Protein Simple) according to the manufacturer’s instructions. 
Rabbit anti-IkB-¢ (9244, Cell Signaling Technology), rabbit anti-PIGR 
(HPAO12012, Sigma-Aldrich), mouse anti-IL-17RA (sc-376374, Santa 
Cruz), mouse anti-B-actin (A1978, Sigma-Aldrich) antibodies were used 
with a 1:50 dilution ratio. Signals were detected by HRP-conjugated 
secondary goat anti-rabbit (042-206, Protein Simple) and goat anti- 
mouse (042-205, Protein Simple) antibodies. 


Immunohistochemistry and immunocytochemistry 

Sections of 5-m thickness were prepared from archived paraffin- 
embedded tissues. Immunostaining of PIGR, secretory IgA, cytokeratin 
20 andiNOS was performed following standard procedures using rabbit 
anti-PIGR (HPAO12012, Sigma-Aldrich, 1:3,500), mouse anti-IgA secre- 
tory component (ab3924, Abcam, 1:1,200), mouse anti-cytokeratin 20 
(M7019, DAKO, 1:100), and rabbit anti-iNOS (ab15323, Abcam, 1:100) 
antibodies. Whole-mount immunofluorescent staining of organoids 
was performed as previously described*®. For immunostaining, rab- 
bit anti-cleaved caspase-3 (Asp175) (9661, Cell Signaling Technology, 
1:400) antibody was used, with subsequent labelling by Alexa Fluor 
568-conjugated anti-rabbit antibody (Thermo Fisher Scientific). Nuclei 
were counterstained with Hoechst 33342 (Thermo Fisher Scientific). 


CRISPR-based knockout screening 

We selected NFKIBZ, IL17RA and PIGR genes for knockout screening. 
Gene targets and guide RNA sequences are listed in Supplementary 
Table 3. For each gene, four or five sgRNA targets were designed, and 
ten negative control sequences without homologous sequences in 
the human genome were also included. The lentiGuide-Puro vector 
(Addgene, 52963) was modified to increase the knockout efficiency, as 
previously described***’”. Custom sgRNA oligonucleotides were pooled 
and cloned into the modified vector. Lentiviral particles that express 
human codon-optimized SpCas9 or pooled sgRNAs were produced 
by atransient transfection of the lentiCas9-Blast (Addgene, 52962) or 
modified lentiGuide-Puro vector. We used two normal colonic organoid 
lines that we previously established as normal counterparts of colon 
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cancer organoids*. The organoids were first infected with the Cas9- 
expressing lentivirus by spinoculation**. To retain Cas9 expression, 
10 pg mI blasticidin S (FUJIFILM Wako Pure Chemical Corporation) 
was added to the medium throughout the experiment. The organoids 
were subsequently infected with the pooled sgRNA lentivirus via spi- 
noculation at a multiplicity of infection of 0.2 and then selected with 
2g mI puromycin for two days. The infected organoids were allowed 
to expand for four weeks either in the presence or absence of 100 ng mI? 
IL-17A with weekly passaging using TrypLE Express. After the treatment, 
DNA was collected using the QlAamp Blood Mini Kit (Qiagen), andthe 
sequencing library was prepared as previously described with two steps 
of PCR amplification”. The amplicons were sequenced with MiSeq 
(Illumina) and the reads without any matching sgRNA sequence were 
filtered out. The abundance of eachsgRNA was calculated as the propor- 
tion of the sgRNA read count to the total filtered reads. The fold-change 
value for each sgRNA was defined as the ratio of the sgRNA frequency 
inIL-17A-treated organoids to that in IL-17A-untreated organoids, and 
the values were further normalized to the mean fold-change value of 
non-targeting controls. 


Genetic engineering of organoids 

Knockout of the NFKBIZ, IL17RA and TRAF3IP2 genes was performed 
using electroporation as previously described™ with slight modifica- 
tions. Guide RNA sequences are listed in Supplementary Table 3. The 
organoids were cultured with the modified culture protocol using IGF-1 
and FGF-2” throughout the experiment. Each sgRNA was co-delivered 
with the GFP-puro piggyBAC transposon vector (PB513B-1, System 
Biosciences). We also adopted a two-day hypothermic incubation at 
30 °C to facilitate genome editing*”™. Five days after electroporation, 
organoids were treated with 2 pg ml puromycin (Thermo Fisher Sci- 
entific) for two days for selection. Puromycin-resistant clones were 
manually picked up, expanded and subjected to sequence confirmation 
by bacterial cloning or sequence trace decomposition™. Loss of the 
IkBZ protein was confirmed for all knockout lines using capillary-based 
immunodetection. 

Knock-in of ZC3H12A mutations was done by homology-directed 
repair using sgRNAs and targeting vectors. The 5’ and 3’ arms that flank 
the DSGXXS motif were designed independently owing to the high 
GC content in the 3’ arm. The 5’ homology arm with a D437Y muta- 
tion (GAC>TAC) (1kb in length) was synthesized with GeneArt Strings 
DNA Fragments (Thermo Fisher Scientific). The 3’ arm (1kbin length) 
was amplified by genomic PCR. The arms were then cloned into the 
pCR2.1-TOPO vector (Thermo Fisher Scientific) using the In-Fusion 
HD cloning kit (Clonetech). sgRNAs and the targeting vector were co- 
electroporated with the GFP-puro piggyBAC transposon vector ina 
similar manner to the knockout experiment. After puromycin selec- 
tion, single organoid clones were manually isolated, expanded and 
subjected to sequencing. Owing to the low knock-in efficiency, all the 
clones witha knocked-in allele contained frameshift mutations inthe 
other allele. We therefore used a clone heterozygous for a knocked-in 
allele and an in-frame frameshift mutation for analysis. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 
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available from the corresponding author on reasonable request. 
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Extended Data Fig. 1| Study design. a, Nomenclature of organoids. b, Patient demographics and the breakdown of the organoids used for each analysis. 
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Extended Data Fig. 2| Mutational signatures of HC, UC and CAN organoids. 
a, Relative contribution of the nucleotide transition patterns in each sample 
group in healthy (n=28), UC"" (n=15), UC" without CAN (n=15) and UC™ with 
CAN (n=17) organoids. Data are shownas the mean (+s.d.) relative contribution 
of base substitution types in each sample type. b, Cosine similarity of the 
mutational signature of each organoid clone to the COSMIC signatures (1-30). 
UC" organoids with an MLHI1 mutation (UCO8T_IS1, UCO8T_IS2) are 
highlighted in red. One organoid line with a contribution of APOBEC signatures 
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(signatures 2 and 13) is highlighted in green characters (UC1OT_IS).c, Number 
of truncating mutations in the indicated genes in healthy, UC" (n= 43) and 
uc" (n=31) organoids. Genes with two or more organoid clones that have 
truncating mutations are shown. MLH1-mutated organoids (UCO8T_IS1, 2) were 
excluded from this analysis. *P= 0.004; q< 0.1 (two-sided Fisher's exact test 
with Benjamini-Hochberg adjustment). d, dN/dS ratios of NFKB/Zand PIGR 
(genes with pglobal < 0.05 and qglobal < 0.1), calculated using the whole- 
exome sequencing data. 


i ee sai alesis 


19202122 


4) 20S ANS SINT LIS) AO A442 43 4445481718 | 


UCOiT_ID 
uco2N_ID 
UCO2N_IS 
UCO3N_ID 
UCO3N_IS 
UCO4N_ID 
UCO4N_IS 


b 


UCOST_IA 
UCO5T_ID 
UCOBN_ID 
Uco6N_IS 
Uco7TIR 
uco7T_IT 
ucosT_is1 
ucosT_is2 
UCt0T_IS 
UCHITIA 
uctiTID 
UC12T IA 
uUct2T_Is 
UC13T_IA 
UCt3T_IS 


Wels 


i 
| 
= 


i 
El 


HC 


uc14TID 
UCI5N_IS 
Uc2iN_IS 


Uc22N_IS 
UC23N_IS 
UC36N_IS 
UC38N_IS 
UC5ON_IR 
UC52N_IS 


HO1_UA 
HO1"UD 
HO1_US 


5 


a 


SESESS55 


HO6_US 


x 
aici 
‘S 
G 
— 


H12_UA 


i 


Ezz 
a Po 
5S5 


5 


H20_UA 


UcosT_TS 
UctoT_TS 
UCI14T_TR 


CAN 
8 
aay 


b 
Ver(uUCcosT ID 4647181930122 
XY 


4 2 3 4 5 6 7 8 91011121045 
2 


(Am eam sees nom wp ebm 
RBFOX1 | 


Log,(CN ratio) 


TRAF3IP2 


—- ese foot Wee Hen i (> 0s Wan Hem Onn AAD HUN WH MHL HVOAIRO EY wt 
-2 


Allelic difference 


Uc (UC10T_IS 


tip2 
2 


0 es a sta oe emp, 
FHI f 


reccieipaae 2 
Y 


3°94 5 6 7 8 9 101112131415 x 


Log,(CN ratio) 


oa 
RBFOX1 


2 


0 me ru pentane noe omen Poms oes ose meme 


om mm te ce re 0 080 HEN aN I et 


2 CN-LOH 


UC (UC15N_IS reqcogngzep 2 
xy 


1 2 3 4 5 6 7 8 9 101112131415 


Allelic difference 


2 


(PRS GS sD SS GSH FB eR PRT SRE 
-2 | 


2 
om imme nest ot i ro Ht ot mt te nt 
0 10s sn aes noses ee eae See an bon me em Die ngNRN 


cm) om whee poem em punsens ebm 2 be pao omen Homa cneen me 


2 CN-LOH 


Log,(CN ratio) 


Allelic difference 


Extended Data Fig. 3 | See next page for caption. 
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Extended Data Fig. 3 | Copy-number analysis of UC organoids. a, Heat map 
showing the average minor allele frequencies (MAFs) of SNPs that were 
heterozygous in the normal counterpart (bin size 25 Mb). Genomic regions 
with LOH are indicated with red boxes. Genomic positions of PIGR, NFKBIZand 
TRAF3IP2 are shownat the top. Organoid lines that were subjected to CGH-SNP 
array analysis are highlighted in red. Organoid lines with MLH1 mutations 
(UCO8) are not shown. b, DNA copy-number analysis using CGH-SNP 
microarray analysis. For each organoid line, the genome-wide chromosome 
state is demonstrated by the log,-transformed signal ratio (top) and the allelic 
difference (bottom) values. Genomic regions with CN-LOH are shown inred. 


Deletions in common fragile sites are shown in green. Genetic lesions 
associated with UC-related mutations are shown in black. c, Loss of PIGR 
protein in UC12T_IA organoids with CN-LOH at the P/GR locus. Intact PIGR 
protein expression in UC12T_IS organoids is also shown. B-actin was used asa 
loading control. Data are representative of n=2 technical replicates with 
similar results. d, Focal deletion of fragile sites in UC organoids, detected by 
CGH analysis. Chromosome states are visualized using the log,-transformed 
signal ratio. Deletions in chromosomal fragile sites at positions 3p14 and 
16p13.2 were uniquely detected in the UC epithelium. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | Characterization of UC-related mutations. a, 
Increased expression of NFKB/IZ mRNA in purified UC epithelium 
(E-MTAB-5464). NFKBIZ expression was compared between healthy control 
individuals (n=11) and patients with UC (n=11) using the exact negative 
binomial test inedgeR (P=5.0 x 10°). Bars represent the median. b, Gene set 
enrichment analysis using public datasets (GSE38713, left; GSE59071, right) 
andin-house IL-17 target genes (Supplementary Table 3). NES; normalized 
enrichment score. Left, n=15 (active colitis), n=13 (healthy control); right, 
n=82 (active colitis), n=11 (healthy control). c—h, IL-17A-induced expression of 
NFKBIZ(IkBZ) and PIGR in colon organoids. Organoids were stimulated with 
100 ng mI"IL-17A for 3 h(c, f) or 24h (d, e, g, h). The expression of NFKBIZ (IKBZ) 


and PIGR in control, NFKB/Z-mutant and TRAF3/P2-mutant organoids was 
analysed by quantitative PCR with reverse transcription (c, f) and immunoassay 
(d,e, g, h). Matched UC""" organoids that were derived from the same patients 
were used as controls. mRNA expression levels are shown relative to the 
expression of ACTB. Expression levels of PIGR protein relative to B-actin are 
shownat the bottom of the pseudo-blot images. Results are shown from two 
(c-e) and one (f-h) biologically independent experiments, and two technical 
replicates (g,h). Dataare shown as mean +s.e.m. (c,f).i, Genetic mutations 
detected by whole-exome sequencing in CAN organoids. Patterns of mutations 
are specified using the same symbolsas Fig. 2a. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5 | Targeted sequencing of UC-related genes. a, 
Consistency of VAFs between fresh epithelium samples and organoids (n=11 
pairs). Red dots and lines show truncating and non-synonymous mutations in 
UC-mutated genes (NFKBIZ, PIGR, IL17RA, TRAF3IP2 and ZC3H12A) (n=12 
mutations). Black dots and lines show synonymous mutations in genes that are 
mutated in UC and any mutations in the other genes (n=19 mutations). VAFs did 
not significantly differ between the sample pairs (P=1, two-sided Wilcoxon 
signed-rank test). b, dN/dS ratios of the indicated genes inthe targeted 
sequencing data. Genes with with q-values <0.1 (thatis, statistically significant) 
are shown. c, Genetic mutations in UC" and control organoids, detected by 
targeted sequencing. Top, for each gene, the number of different variants is 


shown by density. Bottom, the VAF in each analysed sample. Each dot 
represents the fraction of each altered allele per total reads mapped tothe 
gene. Genes are shown in different colours as specified onthe left.d, PIGR 
(top), secretory IgA (middle) and cytokeratin20 (CK20) (bottom) 
immunostaining of UC epithelium (UCO9T_IS2 and UC17T_IA) with focal loss 
of PIGRand secretory IgA expression. Magnified areas show boundaries 
between regions in which PIGR expression is intact and regions in which PIGR 
expression is lost. Images are representative images of four regions in which 
PIGR expression is lost, in eight patients with UC. Scale bars, 500 pm (low 
magnification); 100 pm (high magnification). 
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Extended Data Fig. 6 | Targeted sequencing of UC-related genes (continued). 


a, b, Reduced response to treatment with IL-17A in patient-derived cloned 
ZC3H12A°*”" UC organoids (UC23N_IS) incomparison to wild-type 
counterpart organoids (UC23N_UA). Results are representative of n=2 
technical replicates with similar results. c, Design of CRISPR-Cas9-mediated 
ZC3H12A°*”" knock-in. The donor vector contains two silent mutations at the 
protospacer adjacent motifs flanking the CRISPR targets, in addition to the 
D437Y mutation. Eachallele of the knocked-in clone used for analysis is shown 
onthe right. The clone is heterozygous for a knocked-in D437Y mutation anda 


42-bp in-frame deletion. d, e, Reduced response to treatment with IL-17A in 
genetically engineered ZC3H12A>*”"" organoids compared to parental wild- 
type organoids. Results are representative of n=2 technical replicates with 
similar results. f, Per-patient data of the targeted sequencing result. For each 
patient, mutations in organoids that were derived from the most-distal sample 
are shown. Top, for each gene, the number of different variants is shown by 
density. Bottom, dots represent the VAF of each variant. Genes are shown in 
different colours as specified on the left. Data are shownas mean +s.e.m. (a,d). 
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per gene are shown by density (top). Top, for each gene, the number of different 


variants is shown by density. Bottom, dots represent the VAF of each variant. 
Genes are shown in different colours as specified onthe left.d, Sporadic colon 


cancer gene mutations identified in CAN organoids. 


Extended Data Fig. 7 | Targeted sequencing of cytokine and PAMP receptor 
genes and cancer driver genes. a—c, Mutationsin cytokine and PAMP receptor 
genes (a) and sporadic coloncancer driver genes (b,c) identified in UC'™, Uc!" 
and control organoids by targeted sequencing. Number of different variants 
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Extended Data Fig. 8 | See next page for caption. 


Extended Data Fig. 8 | IL-17A-induced cytotoxic response in human colon 
organoids. a, Fold-change values of individual sgRNAs in two experiments that 
used different lines of organoids. sgRNAs are coloured according to their 
targeting genes, as specified at the top. b, Response of wild-type organoids to 
treatment with IL-17A in differing growth factor conditions. The organoids 
showed acytotoxic response to treatment with IL-17A only in the absence of 
Noggin. The WNRAIF condition is the medium condition including WNT3A, 
Noggin, R-spondin-1(Rspo), A83-01, IGF-1and FGF-2, but lacking EGF. Data are 
representative of n=2 biologically independent experiments with similar 
results. c, sgRNA targets and confirmation of CRISPR-Cas9-mediated 
knockout of /LI7RA, TRAF3/P2 and NFKBIZ. For knockout of NFKBIZ, three 
NFKBIZ* lines (#1-3) were generated from different parental organoids. 
Successful knockout was confirmed by Sanger sequencing in all lines. Loss of 
IkBC protein was also confirmed by an immunoassay using an organoid 
stimulated with IL-17A for 24 h. Result is froma single experiment. d, /LI7RA® 


and TRAF3IP2 organoids are resistant to treatment with IL-17A in a Noggin- 
deficient condition. Right, dots show the relative area of organoids in each well. 
Results are representative of n=3 technical replicates. e, NFKBIZ® organoid 
line1 tolerates treatment with IL-17A. The other two knockout lines showed a 
similar response. f, Patient-derived NFKB/Z-mutant organoids (UCO2N_IS) are 
resistant to treatment with IL-17A. Wild-type counterpart organoids (UCO2N_ 
UA) were used asacontrol. Data are representative of n=2 biologically 
independent experiments with similar results. g, Patient-derived ZC3H12A°””’- 
mutant organoids (UC23N_IS) are tolerant of treatment with IL-17A compared 
to their wild-type counterparts (UC23N_UA). Data are representative of n=2 
technical replicates with similar results. h, Knockout of 7P53 mitigates IL-17A- 
induced cytotoxicity. Data are representative of n=2 biologically independent 
experiments with similar results. Mean + s.e.m (d-h). Pvalues determined by 
two-sided Welch’s t-test. Scale bars, 1mm (b, d-h). 
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Extended Data Fig. 9 |iNOS-dependent apoptosis in human colon organoids 
treated with IL-17A. a, Detection of cleaved caspase-3 by immunoassay. 
Treatment with IL-17A increases the expression of cleaved caspase-3 in wild- 
type, but not NFKBIZ®, organoids in the absence of Noggin. b, Analysis of NOS2 
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transcript expression by quantitative PCR. IL-17A induces expression of NOS2 
transcript in wild-type, but not in NFKBIZ®, TRAF3IP2" and IL17RA®, 
organoids inthe absence of Noggin. 
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Extended Data Table 1| Clinical information on the participants in the study 


ID Disease Age Sex uc extent Sample Counterpart "ap Disease poogex _-UCextent == Sample Counterpart 
duration or patient status collection Normal duration or patient status collection Normal 
UCO1T 25 62. F Extensive Surgery Blood UC37N 5 29 M Extensive Biopsy None 
UCO2N 13 54 F Left sided Biopsy Blood UC38N 27 58 M Left sided Biopsy Blood 
UCO3N 6 61 F Left sided Biopsy Blood UC39T 17 35 M Extensive Biopsy None 
UCO4N 32 45 M Left sided Biopsy Blood UC40T 18 34 =F Extensive Biopsy None 
UCOST 12 48 F Extensive Biopsy Blood UC41T 1 68 M Extensive Biopsy None 
UCO6N 27 43 M Left sided Biopsy Blood UC42T 10 70 M Left sided Biopsy None 
UCO7T 12 37 =F Extensive Biopsy Blood UC43T 18 50 M Extensive Biopsy None 
UCO8T 9 62 M Extensive Surgery Blood UC44T 1 68 M Extensive Surgery None 
UCOST 12 43 OF Left sided Surgery Blood UC45T 22 47 M Left sided Surgery None 
UC10T 40 62 M Extensive Surgery Blood UC46N 22 tL FE Left sided Biopsy None 
UC11T 35 76 =F Extensive Surgery Blood UC48N 25 53 M Extensive Biopsy None 
UC12T 16 61 M Extensive Surgery Blood UC49N 9 60 M Extensive Biopsy None 
UC13T 16 30 M Extensive Surgery Blood UC50N 22 50 M Proctitis Biopsy Blood 
UC14T 10 74 M Left sided Surgery Blood UCSIN 15 50 M Extensive Biopsy None 
UC15N 6 38 =F Left sided Biopsy Blood UC52N 15 34 =F Left sided Biopsy Blood 
UC16T 31 67 F Extensive Surgery None HO1 NA 57 M Healthy Biopsy Blood 
UC17T 12 51 =F Extensive Surgery None HO2 NA 48 M Healthy Biopsy Blood 
UC18T 19 37 M Extensive Biopsy None HO3 NA 44 F Healthy Biopsy Blood 
UC19T 16 52 M Extensive Surgery None H04 NA 50 M Healthy Biopsy Blood 
UC20T 14 55. \F Extensive Surgery None HO5 NA 65 F Healthy Biopsy Blood 
UC21N 12 ac -F Left sided Biopsy Blood HO6 NA 75 M Healthy Biopsy Blood 
UC22N 24 48 M Left sided Biopsy Blood H11 NA 67 M Healthy Biopsy Blood 
UC23N 18 36 6F Left sided Biopsy Blood H12 NA 58 M Healthy Biopsy Blood 
UC24N 23 46 F Extensive Biopsy None H13 NA 57 FF Healthy Biopsy Blood 
UC25N 6 23 M Extensive Biopsy None H14 NA 77 M Healthy Biopsy Blood 
UC26N 15 47 F Left sided Biopsy None H15 NA 70 F Healthy Biopsy None 
UC27N 9 35 M Extensive Biopsy None H16 NA 54 =F Healthy Biopsy None 
UC28N 2 39 =F Extensive Biopsy None H17 NA 70 F Healthy Biopsy None 
UC29N 3 56 F_ Right sided +proctitis Biopsy None H18 NA 49 F Healthy Biopsy None 
UC30N 10 38 F Right sided +proctitis Biopsy None H19 NA 32 M Healthy Biopsy Blood 
UC31N 41 67 M Extensive Biopsy None H20 NA 22 F Healthy Biopsy Blood 
UC32N 6 56 M Left sided Biopsy None CACO2 28 45 M Extensive Surgery Non-tumor 
UC33N 14 65 M Extensive Biopsy None CAC04 30 45 M Left sided Surgery Non-tumor 
UC34N 8 75 M Left sided Biopsy None CACO6 39 54 F Extensive Surgery Blood 
UC35N 4 20 M Proctitis Biopsy None CAC08 11 47 M Extensive Surgery Blood 
UC36N 16 29 M Left sided Biopsy Blood 


Patient IDs and other related information in our cohort. The IDs are specified as described in Extended Data Fig. 1a. 


natureresearch ee aan 


Last updated by author(s): Oct 21, 2019 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 
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n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


[| A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 
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Give P values as exact values whenever suitable. 
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Software and code 


Policy information about availability of computer code 


Data collection No software was used for collecting data. 


Data analysis For whole exome sequencing analysis, raw fastq reads were mapped onto the human reference genome GRCh37 (hg19) with BWA (ver 
0.7.8). A GATK (ver 1.6.13) toolkit UnifiedGenotyper was used for SNV and indel calling. SnpEff (ver 3) was used for the annotation of 
SNPs in dbSNP build135 and the 1000 genome project database. R (ver 3.5.1) was used for HGVD (release ver 2.3.0) annotation, filtering 
and counting the number of mutations. Ime (R package nlme, ver 3.1) was used for apply a linear mixed model to estimate the effect of 
age, disease duration and disease groups on the number of SNVs. MutationalPatterns (ver 1.8.0, R package) was used for mutation 
signature analysis. Fisher's exact test was done using the fisher.test function (R package stat, ver 3.5.1) . dndscv (ver 0.0.1.0, R package) 
was used for dN/dS analysis. 

For targeted sequencing analysis, SureCall (ver 3.5, Agilent) was used for mapping (BWA-MEM), for SNV and indel calling (SNPPET SNP 

Caller) and for annotation of SNPs in dbSNP and the 1000 genome project database. SnpEff (ver 4.3t) was also used for EFFECT 

annotation to specifically detect splicing donors and accepters. R (ver 3.5.1) was used for HGVD (release ver 2.3.0) annotation, filtering 

and counting the number of mutations. dndscv (ver 0.0.1.0, R package) was used for dN/dS analysis. 

For copy number analysis, apt-copynumber-cyto-ssa (Affymetrix Power Tools, ver 1.19.0) was used for obtaining probe-level copy 

number and allele difference. segmentCGH (R package rCGH, ver 1.8.1) was used for segmentation analysis. 

For gene expression analysis, rma (R package affy, ver 1.56.0) was used for normalization. Gene Set Enrichment Analysis module (ver 

3.24.3, GenePattern) was used for GSEA. Differentially expressed genes were determined using limma (ver 3.40.2, R package). Public 

RNA-seq data was analyzed using edgeR (ver 3.24.3, R package). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Data 
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All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


Gene expression datasets were deposited in Gene Expression Ominibus. Accession code of GEO is GSE127757. 

The secure token of GEO is as followed :ijoxsgmafxizbun. 

Whole exome datas were deposited in JGA database. Accession number is JGASOO000000199. 

All relevant data that are included with this study are available from corresponding author upon reasonable request. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No formal statistical methods were used to predetermine sample size. Instead, we enrolled all individuals who consent to participate our 
study during the enrollment period between 1/Aug/2014 and 31/Mar/2019. We collected 44 and 144 colorectal samples from 16 HCs and 55 
UC patients, respectively. The sample size of our study was sufficient to assess our sequencing data and to confirm novel findings with 
statistical significance. 


Data exclusions All the samples which we could established organoids were used for analysis. UC organoids with a MLH1 mutation were excluded from the 
mutation number analysis. 


Replication All attempts at replication were successful. 


Randomization — Not applicable for this study because this study is a case-series study . 


Blinding The experiments and analysis used did not require blinding 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Primary antibodies used in this study were as follows: rabbit anti-cleaved-caspase3 (Asp175) (9661, Lot: 45, Cell Signaling 
Technology, 1:400), rabbit anti-PIGR (HPAO12012, Lot: c106465, Sigma-Aldrich, for IHC 1:3500, for capillaraly-based 
immunodetection 1:50), rabbit anti-IKBZ (9244, Lot: 2, Cell Signaling Technology, 1:50), mouse anti-B-Actin (AC-15) (A1978, 
Lot:037M4782V, Sigma-Aldrich, 1:50), mouse anti-lgA secretory component (ab3924, Lot: GR3236730-3, Abcam, 1:12000), 
rabbit anti-iNOS (ab15323, Lot: GR3272520-2, Abcam, 1:100), mouse anti-cytokeratin 20 (Ks20.8) (M7019, Lot: 20061091, Dako, 
1:100), mouse anti-IL17R (G-9) (sc-376374, Lot: K2817, Santa Cruz, 1:50). 

Secondary antibody used in this study was as follows: donkey polyclonal anti-rabbit IgG (H+L), Alexa Fluor 568 (A10042, Thermo 
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Fisher Scientific, 1:500), goat anti-rabbit secondary HRP conjugate (042-206, Protein Simple), goat anti-mouse secondary HRP 
conjugate (042-205, Protein Simple) 


Validation All antibodies used were validated by the respective vendors for the applications used in this study. The specificity of anti-PIGR, 
anti-IkBz and anti-igA secretory component antibodies were verified by organoid knockout experiments. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) All organoids were established from the tissues obtained from patients at Keio University Hospital or Tokyo University 
Hospital. 

Authentication None of the organoids were authenticated 

Mycoplasma contamination The organoids were not tested for mycoplasma contamination. 
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Commonly misidentified lines None of the used cell lines are listed in the ICLAC database of commonly misidentified cell lines. 
(See ICLAC register) 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Participants were recruited through Keio University Hospital and Tokyo University Hospital. We enrolled a total of 16 (8 males 
and 8 females) HCs and 55 (33 males and 22 females) UC patients who underwent surgery and/or therapeutic or diagnostic 
endoscopy for lower gastrointestinal symptoms. 20-75 year-old UC patients and 22-77 year-old HCs were enrolled. Among these 
patients with UC, 26 had concomitant CAN and the remaining 29 were without CAN. All subjects and samples were anonymized 
with unique IDs before analysis. The detailed characteristics of cases are shown in Supplementary Table 3. 


Recruitment The individuals who underwent colonoscopy or surgery for gastrointestinal disease or symptoms were randomly recruited with 
written informed consent. 


Ethics oversight This study was approved by the ethics committees of Keio University and Tokyo University. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Chronic inflammation is accompanied by recurring cycles of tissue destruction and 
repair and is associated with an increased risk of cancer’ °. However, how such cycles 
affect the clonal composition of tissues, particularly in terms of cancer development, 
remains unknown. Here we show that in patients with ulcerative colitis, the inflamed 
intestine undergoes widespread remodelling by pervasive clones, many of which are 


positively selected by acquiring mutations that commonly involve the NFKBIZ, 
TRAF3IP2, ZC3H12A, PIGR and HNRNPF genes and are implicated in the 
downregulation of IL-17 and other pro-inflammatory signals. Mutational profiles vary 
substantially between colitis-associated cancer and non-dysplastic tissues in 
ulcerative colitis, which indicates that there are distinct mechanisms of positive 
selection in both tissues. In particular, mutations in NFKB/Z are highly prevalent in the 
epithelium of patients with ulcerative colitis but rarely found in both sporadic and 
colitis-associated cancer, indicating that NFKB/Z-mutant cells are selected against 
during colorectal carcinogenesis. In further support of this negative selection, we 
found that tumour formation was significantly attenuated in Nfkbiz-mutant mice and 
cell competition was compromised by disruption of NFKB/Zin human colorectal 
cancer cells. Our results highlight common and discrete mechanisms of clonal 
selection in inflammatory tissues, which reveal unexpected cancer vulnerabilities that 
could potentially be exploited for therapeutics in colorectal cancer. 


Chronic inflammation is a major cause of morbidity and mortality in 
the human population and posesa substantial cancer risk’”. Acommon 
feature of chronic inflammation is recurring tissue injury and repair, 
which is fuelled by long-lasting, unregulated immunity and often cul- 
minates in tissue remodelling and debilitating organ dysfunction’. 
Previous studies have shown that extensive tissue remodelling can 
take place in an age-dependent manner even in tissues that are appar- 
ently normal, and that this is mediated by the expansion of numerous 
clones that carry commoncancer-related mutations* °. The expansion 
of somatically mutated clones may also have a role in inflammation- 
associated remodelling of tissues, explaining the cancer risk that is 
associated with inflammation. However, it is largely unknown whether 
this actually happens—andi if so, the frequency and extent of the clonal 
expansion, the mutations that are involved, and how it modifies the 
inflammatory disease process and the development of cancer. 


Ulcerative colitis (UC) isa common form of inflammatory bowel 
disease that affects around 2.1 million people in Europe alone’. It 
is characterized by persistent inflammation of the large intestine, 
which causes severe destruction of the intestinal mucosa and the 
formation of intractable ulcers. Although the pathophysiology of UC 
is not fully understood, deregulated immune cells and cytokines—as 
well as intestinal dysbiosis—have been implicated in the persistent 
inflammation®, which is associated with an increased risk (15-20%) 
of cancer development (colitis-associated colorectal cancer; CAC)’. 
In this study, to understand how chronic inflammation shapes tis- 
sue remodelling and how it correlates with cancer development, 
we investigated clonal expansion in epithelia from patients with 
UC and from control individuals not affected by UC, using intensive 
multi-regional sampling followed by unbiased detection of somatic 
mutations. 


A list of affiliations appears at the end of the paper. 
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Fig. 1| Somatic mutations in single crypts. a, Correlation between the number 
of mutations in single crypts (derived from WES) and the age of the individuals 
from whom the crypts were derived. Two-sided Mann-Whitney Utest for 
comparison between UC and non-UC samples. A regression line that assumes 
an intercept of zero for non-UC samples is shown with R? and coefficient values 
(blue). The R? value (red) of the linear regression model for UC samples is also 
shown (Methods). b, Average number of mutations in paired single crypts from 


Clonal history of normal and UC crypts 


The colorectal epithelium is composed of single-layer columnar cells 
that are compacted into numerous small replication units known as 
crypts. Each crypt comprises around 2,000 cells, which are replen- 
ished by the stem cells located at the bottom of the crypt’. During 
life, expansion of the colorectal epithelium and repair of physiological 
or pathological deficits of crypts are accomplished by crypt fission”. 
Thus, to understand the dynamics of UC and non-UC epithelia, we 
first evaluated somatic mutations in single crypts that were isolated 
from samples from patients with UC and unaffected individuals, using 
whole-exome sequencing (WES) of whole-genome-amplified single- 
crypt DNA (Extended Data Figs. 1-3). We found that the number of 
mutations in non-UC crypts increases with age at an annual mutation 
rate of 0.61 mutations per exome per year, and that this rate is sig- 
nificantly increased in crypts derived from patients with UC (Fig. 1a). 
When corrected for age at diagnosis and disease duration (Methods), 
the mutation rate in UC crypts was more than three times higher than 
that of non-UC crypts (1.9 mutations per exome per year). Whereas 
the mutation rate did not differ between the right and left side of the 
colon in control individuals, we observed that mutation rates were 
higher for the crypts from the left side of the colon in patients with UC, 
in which inflammation tends to be more severe than in the right side® 
(Fig. 1b). Although we found no significantly mutated genes among 
crypts from normal epithelia, three genes (NFKBIZ, PIGR and ARIDIA) 
were significantly affected by non-synonymous mutations in UC 
epithelium (ratio of non-synonymous to synonymous substitutions 
(dN/dS) > 1.0; q< 0.05), suggesting the driver roles of these genes. 
Next, we analysed the history of crypt expansion in normal and 
inflamed epithelia by isolating a cluster of crypts from a small area of 
colorectal epithelium (0.25-4 mm”) and analysing them by WES, retain- 
ing their geographical information (Extended Data Fig. 2b). We analysed 
crypt clusters from three patients with UC (aged 38-56 years) and three 
unaffected individuals (aged 78-88 years). As seen in isolated crypts, 
UC crypt clusters had substantially higher mutation rates (2.86-15.5 
mutations per exome per year) compared with non-UC crypt clusters 
(0.63-0.75 mutations per exome per year). Most of the crypts (99.4%) 
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the right and left side of the colon in each patient with UC or control individual. 
Two-sided paired t-test. c, Clonal structure within adjacent crypts in epithelia 
from acontrol individual (CR44) anda patient with UC (HCMO9). The 
corresponding clones in the phylogenetic trees and geographical maps are 
depicted by numbers and colours. The estimated time of UC onset, the age of 
the most-recent common ancestral crypt in HCMO9 and branches that contain 
driver mutations are indicated (see Methods). 


in both patients with UC and unaffected individuals shared one or more 
mutations with other crypts. On the basis of this finding, we constructed 
phylogenetic trees and superimposed them on the geographical maps 
of the crypts (Fig. 1c, Extended Data Fig. 4a—d). In non-UC epithelium, 
the majority (99/143, 69.2%) of the branch points in the phylogenetic 
trees were estimated to occur before 20 years of age (corresponding to 
12.2 mutations) (Extended Data Fig. 4e). This suggests that the number 
of crypts steadily increases until the age of 20 years to expand the size 
of the intestine (with an annual fission rate of around 0.136), whereas 
in the steady state that occurs throughout the rest of life, crypt fissions 
occur only occasionally to balance the physiological deficits of crypts 
(0.00413 fissions per year) (Methods). By contrast, the crypts from 
patients with UC had very different phylogenetic structures (Fig. 1c, 
Extended Data Fig. 4c, d). The phylogenetic trees were characterized 
by a notably long trunk containing known driver mutations, which 
terminated in many short branches. This indicates that after repetitive 
destruction and regeneration, these UC crypts rapidly expanded from 
a positively selected common ancestor crypt very recently (during the 
past 1.7-4.7 years; 3.3 years on average). 


Positive selection in the UC epithelium 

To further investigate clonal expansion in the UC epithelium, we per- 
formed WES of bulk crypts isolated from 269 non-dysplastic samples 
that were collected from 22 patients with UC and 23 control individuals, 
with a median of 6 (range, 4-8) samples per individual (Extended Data 
Fig. 1b). We also analysed an additional 294 bulk-crypt samples, which 
were collected from surgically obtained rectums from 6 patients with 
UC at regular intervals (Extended Data Fig. 1b). Compared with non-UC 
samples, UC samples had much higher numbers of mutations (51.2 
versus 1.9 mutations per sample on average) (Extended Data Fig. 5a). 
Maximum mutant-cell fractions (MCFs) were significantly higher for UC 
samples (mean = 0.58 (range, O-1)) than non-UC samples (mean =0.21 
(range, O-1)) (P< 0.01) (Methods), with 22.6% of UC samples showing 
a maximum MCF of 1.0. The number of mutations and the maximum 
MCF in UC samples weakly but significantly correlated with the interval 
between diagnosis and sampling (Extended Data Fig. 5b, c). Samples 
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Fig. 2| Driver genes and clonal expansion in the UC epithelium. 

a, Distribution of ZC3H12A mutations in non-dysplastic UC samples (58 isolated 
single crypts and 399 bulk crypts; top) and CAC samples (n= 99; bottom). 

b, Landscape of driver mutations that were detected in clones (n=183) in 
non-dysplastic samples from patients with UC. The frequencies of each 
mutation are indicated onthe right. Genes with asterisks are driversin 
non-dysplastic samples from patients with UC. LOH, loss of heterozygosity. 

c, Mutations inIL-17 and other pro-inflammatory signalling pathways in non- 
dysplastic samples from patients with UC. Mutated components are indicated 
by coloured boxes. d, Expansion of clones within the rectum from two 
representative patients with UC (out of six). Histology is shown by the colour of 


from the left side of the colon tended to show higher numbers of 
mutations and higher MCFs compared with those from the right side 
(Extended Data Fig. 5d, e). 

To decipher the mechanism of positive selection in the UC epithe- 
lium, we analysed the dN/dS ratios for 30,062 mutations that were 
detected in399 non-dysplastic samples that were derived from patients 
with UC, and identified 14 driver genes for which there was evidence of 
positive selection (dN/dS > 1.0, q< 0.05) (Fig. 2a, b, Extended Data Fig. 6) 
(Methods). By contrast, no positively selected mutations were detected 
in179 samples from control individuals. Most of these driver gene muta- 
tions inthe UC epithelium were predicted to cause protein truncation 
and loss of function. Exceptions were mutations in ZC3H12A, KRAS and 
HNRNPF, which exhibited prominent mutational hotspots indicative of 
some neomorphic functions (Fig. 2a, Extended Data Fig. 6a). NFKBIZ, 
TRAF3IP2 and RNF43 frequently underwent biallelicinactivation, which 
was less common in other drivers (Fig. 2b). 

Asingle bulk-crypt sample may contain multiple independent clones 
and subclones that carry discrete driver mutations. Thus, to precisely 
determine the frequency of mutations in the UC epithelium on an inde- 
pendent clone basis, we first clustered mutations that were detected in 
bulk-crypt samples using PyClone and—on the basis of the predicted 
cell fraction for each cluster of mutations—identified a total of 183 
independent clones according to the ‘pigeonhole principle’ (Extended 
Data Fig. 1d, Methods). Among these, 166 (90%) contained one or more 
driver mutations, which most frequently affected NFKBIZ, followed 
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in each rectum derived from patients with UC. 


by ARIDIA, PIGR, KRAS and ZC3H12A (Fig. 2b). Notably, the products 
of the genes that were mutated in the UC epithelium overlapped with 
signalling pathways that converge on NFKBIZ (also known as IkBC) and 
related molecules (ZC3H12A and PIGR) (Fig. 2c), with the mutations in 
these signalling-related molecules occurring in a mutually exclusive 
manner (Fig. 2b). Most prominent among these mutational targets are 
molecules that are involved in IL-17 signalling—including IL-17 receptors 
(ILI7RA and IL17RC) and the adaptor of IL-17 receptors, TRAF3IP2, which 
were shown to be positively selected in the UC epithelium (Methods)— 
suggesting a major role of IL-17 signalling in this positive selection 
(Fig. 2c). Moreover, other inflammation-related cytokine receptors 
and pattern recognition receptors (PRRs) that are implicated in the 
upregulation of NFKBIZ (including OSMR, LTBR, IL2ORA and TLR9”; 
Extended Data Fig. 6b) also underwent loss-of-function mutations in 
occasional clones (Methods). 

We assessed the expansion of clones carrying mutations in these 
cytokine and PRR pathway components in surgically obtained rectums 
from six patients with UC using densely collected samples (Fig. 2d, 
Extended Data Fig. 7a). We identified 41 independent driver-mutated 
clones that involved two or more samples in these specimens, of which 
31 had mutations in the cytokine and PRR pathways. Notably, in these 
patients, 53-83% of the entire rectum was made up of clones with one or 
more driver mutations, most of which were explained by mutations in 
the cytokine and PRR pathways (Fig. 2e). The clonal expansion could be 
extensive, occupying an epithelial area as large as 19 cm? (Extended Data 
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Fig. 3 | Effect of driver mutations. a, Fold change in expression levels of Nfkbiz 
and Pigr (mean +s.d.) in colon epithelial organoids that were established from 
Nfkbiz™ (control) and Nfkbiz"Vill-cre (Nfkbiz cKO) mice at the indicated time 
points after stimulation with IL-17A. Asterisks indicate P< 0.05 (two-sided 
Mann-Whitney Utest). b, Left, immunoblot (IB) analysis of HCT116 cells that 
express empty vector, Myc-tagged wild-type (WT) ZC3H12A or the indicated 
ZC3H12A mutants after stimulation with IL-17A for the indicated times, using 
anti-Myc and B-actin antibodies. Data are representative of two independent 
experiments. Right, the ratio of protein to mRNA for ZC3H12A (n=3 biological 
replicates) (mean +s.d.). Asterisk indicates P< 0.05 between mutant and 
wild-type ZC3H12A (two-sided Student’s f-test). For gel source data, see 
Supplementary Fig. 1.c, Fold change (mean +s.d.) inthe ratio of Nfkbiz” 
(knockout) to Nfkbiz" (flox) alleles at the indicated number of passagesin 
NfkbizcKO and control organoids co-cultured in the presence or absence of IL- 
17A. Asterisks indicate P< 0.05 (two-sided Mann-Whitney Utest). d, Density of 
LacZ-positive areas having indicated sizes in control and Nfkbiz cKO mice. 
Asterisks indicate P< 0.05 (two-sided Mann-Whitney Utest). 


Fig. 7b). In patient HCM22, almost all of the rectal epithelium (around 
300,000 crypts) was replaced by only two clones that carry mutations 
in NFKBIZ (frameshift mutation at S339 (S339fs)) and ZC3H1Z2A (S438L). 
In an analysis of phylogenetic structure, 37 clones had one or more 
driver mutations in their main trunk, and 26 of these 37 clones had 
mutations in the IL-17 pathway or other cytokine or PRR pathways— 
suggesting that these mutations were acquired before or early in the 
development of UC (Fig. 2d, Extended Data Fig. 7). 


Effect of NFKBIZ and other mutations 


Among these genes that are involved in cytokine and PRR signalling, 
those that were most commonly mutated were NFKBIZ (30%), PIGR 
(21%) and ZC3HI12A (14%). NFKBIZ—which is upregulated by anumber 
of primary stimuli, suchas IL-17A, IL-1f and lipopolysaccharide (LPS)—is 
induced by activated NFKB to regulate the expression of secondary 
response genes that include those encoding many pro-inflammatory 
cytokines” (Fig. 2c, Extended Data Fig. 6b). PIGR, which encodes a key 
molecule for the transcytosis of immunoglobulin A (IgA), is onesuch 
gene, and is known to be downregulated in the colorectal epithelium 
inIL-17R-deficient mice® and in patients with inflammatory bowel dis- 
ease’®. Thus, to understand the functional link between NFKB/Zand PIGR 
with regard toIL-17 signalling, we established Nfkbiz and Nfkbiz*"* orga- 
noids from colorectal epithelium from Nfkbiz“Vill-cre and Nfkbiz™ 
mice and examined the expression of Pigr after stimulation with IL-17A 
(Fig. 3a). In Nfkbiz‘* organoids, Pigr expression was induced by IL-17A 
slightly after the upregulation of Nfkbiz and increased until 24 h after 
stimulation. Conversely, in Nfkbiz’ organoids, the induction of Pigr 
was significantly attenuated with no Nfkbiz induction (Fig. 3a). These 
observations confirm that NFKBIZis a key regulator of Pigr expression 
upon IL-17A stimulation. 

ZC3H12A encodes an RNase known as Regnase-1, which is impli- 
cated in the degradation of the mRNA of NFKB/Z and many down- 
stream secondary response genes, including PIGR™"®. Mutations in 


the UC epithelium were highly enriched in the DSGxxS motif and the 
C-terminal part, and showed a prominent mutational peak at the S438 
residue (Fig. 2a). This residue has been previously shown to be phos- 
phorylated by the IkB kinase (IKK) complex to promote ubiquitina- 
tion and subsequent degradation of the protein”. In mice, a mutation 
at the corresponding site, S435A, is reported to prevent ubiquitin- 
mediated degradation and lead to increased expression of ZC3H12A 
after stimulation with IL-1B. We also demonstrated that cells that 
express ZC3H12A5*™ showed a higher level of protein expression 
compared with wild-type transfected cells (Fig. 3b, Extended Data 
Fig. 6c). Of note, we found a UC sample that contained a nonsense 
mutation affecting BTRC, which encodes the E3 ligase that is implicated 
in the ubiquitination of ZC3H12A. Moreover, we demonstrated that 
expression of a dominant-negative form of BTRC (BTRC(AF)) stabi- 
lized ZC3H12A when stimulated with IL-17A or IL-1B” (Extended Data 
Fig. 6d)—providing further support for the role of DSGxxS mutants in 
positive selection. Anincrease in protein expression compared to the 
wild type was also demonstrated for a C-terminal-truncating ZC3H12A 
mutant (W543*). Notably, the W543* mutant showed a much higher level 
of protein expression than the S438L mutant even though their levels 
of mRNA were similar (Fig. 3b), which suggests that the C-terminally 
deleted protein has a higher stability than the wild type regardless 
of IL-17A stimulation. Another gene that is implicated in the mRNA 
degradation of pro-inflammatory cytokine genes”’, HNRNPF, showed 
mutational hotspots that affect highly conserved tyrosine-containing 
residues (amino acids 201-210) (Extended Data Fig. 6a), although its 
link to the IL-17 signalling pathway is unknown. 

We next examined the role of IL-17 signalling in the positive selection 
of NFKBIZ-deficient cells in the UC epithelium. When an equal num- 
ber of colon organoids from Nfkbiz’ and Nfkbiz™ mice were mixed 
and co-cultured with or without IL-17A, we observed an increasing 
allelic ratio of Nfkbiz relative to Nfkbiz" in the presence of IL-17A. By 
contrast, inthe culture without IL-17A, the ratio was decreased, which 
suggests that in the presence of IL-17A, Nfkbiz-null cells have a selective 
advantage over cells in which the expression of Nfkbizis intact (Fig. 3c). 
We also investigated the driver role of NFKBI/Z in UC in vivo by gen- 
erating R26°°"Nfkbiz“'Vill-cre® and R26°°2™Nfkbiz"™’ Vill-cre®® 
mice in which colitis was induced with dextran sulfate sodium (DSS); 
aberrant IL-17 signalling has previously been implicated in this DSS 
model of colitis’. In R26°°""Nfkbiz Vill-cre®* mice, cells in which 
Nfkbizwas deleted were randomly introduced in the colorectal epithe- 
lium and visualized by LacZ staining. After treatment with tamoxifen 
(to induce Cre-mediated recombination), we induced chronic colitis 
in the mice by oral administration of DSS, and compared the size of 
the LacZ-positive area between Nfkbiz’ and Nfkbiz‘* mice 11-12 weeks 
after tamoxifen was administered (Extended Data Fig. 8). Overall, the 
total fraction of the LacZ-positive areas, as well as the severity of inflam- 
mation (as evaluated by body weight, colon length and histology), did 
not significantly differ between these mice (Extended Data Fig. 8f-h). 
However, the number of LacZ-positive areas of size 256-16,384 im? 
was significantly increased in Nfkbiz’ mice compared with Nfkbiz* 
mice (Fig. 3d), which was not observed without treatment with DSS 
(Extended Data Fig. 8d). RNA sequencing of DSS-treated colorectal 
epithelium revealed downregulation of IL-17 signalling pathway genes 
and downstream targets of NFKBIZ in Nfkbiz" mice compared with 
Nfkbiz‘’* mice (Extended Data Fig. 8i), which further supports the role of 
NFKB/Z mutations in the positive selection of cells inthe UC epithelium. 


Negative role of NFKBIZ mutations in CAC 


We next investigated driver mutations and copy-number alterations 
(CNAs) in99 CAC samples from patients with UC—including 65 samples 
for which WES data were publicly available” *° and 34 that were newly 
analysed with WES—together with 356 cases of sporadic colorectal 
cancer (SCRC) from The Cancer Genome Atlas (TCGA) (Fig. 4, Extended 
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Fig. 4| Comparison of driver mutations between UC non-dysplasia and 
cancer. a, Frequency of driver mutations among clones identified in non- 
dysplastic (ND) samples from patients with UC, as well as CAC and sCRC 
samples. Asterisks indicate q< 0.1 (two-sided Fisher’s exact test with 
Benjamini-Hochberg adjustment). b, Observed frequency of mutation of the 
indicated genes in CAC samples fromaand the 95% Cl of the frequency of 
mutation expected from the observed fraction in 432 non-cancer samples from 
patients with UC. Asterisks indicate q < 0.05 (two-sided binomial test with 
Benjamini-Hochberg adjustment). c, Box plots of normalized fold change in 
the frequency of indicated sgRNA sequence reads after four weeks of culture 
compared with the pre-culture value in five human CRC celllines (n=6 
biological replicates). Significant differences between control and NFKBIZ- 
directed sgRNAsare provided (two-sided Mann-Whitney Utest). 

d, Representative images of AOM-DSS-induced colorectal tumours in control 
(3 out of 25; top) and Nfkbiz cKO (3 out of 25; bottom) mice. e, Box plots of sum 
of tumour size (that is, the sum of the maximum diameter of all tumours) per 
mouse. Two-sided Mann-Whitney Utest. In all box plots, the median, first and 
third quartiles are indicated and whiskers extend to the furthest value within 
1.5 x the interquartile range (IQR). 


Data Figs. 1c, 9). We also performed WES in 13 samples with low- and 
high-grade dysplasia from patients with UC. 

Dysplastic and CAC samples from patients with UC had significantly 
higher numbers of mutations compared with non-dysplastic samples 
from patients with UC (Extended Data Fig. 9c, d). An analysis of muta- 
tional signatures in UC samples identified two predominant signatures, 
signatures A and B (Methods) (Extended Data Fig. 9c, e). Corresponding 
to COSMIC signature 1, signature A mostly consisted of C>T substi- 
tutions in the CpG context and was predominant across in all tissue 
types. Signature B, which corresponds to COSMIC signature 17, was 
characterized by T>G and T>C substitutions in a C[T]T context and 
was highly enriched in UC samples (Extended Data Fig. 9f), implying 
that it is linked to chronic inflammation. 

The profile of driver mutations was substantially different between 
non-dysplastic UC samples and cancer samples. dN/dS analysis iden- 
tified six genes that were significantly positively selected in CAC 
(Extended Data Fig. 9a). These genes were also commonly mutated in 
sCRC (Fig. 4a) but were rarely affected in non-dysplastic UC samples, 
in which mutations in NFKBIZ, ARIDIA, ZC3H12A, PIGR and ETV6 were 
significantly enriched. Copy-number profiles were also very different 
between CAC or dysplastic samples, and non-dysplastic samples. CNAs 
were obligatory and extensive in CAC, but much less common (41%) in 
non-dysplastic samples (Extended Data Fig. 9a, g, h). 

Although highly prevalent in non-dysplastic UC samples, NFKBIZ 
mutations were found in none of the 99 CAC samples. Thus, CAC do 
not develop from NFKB/Z-mutated cells, which suggests that NFKBIZ- 
mutated clones that were positively selected in UC epithelium havea 
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strong relative disadvantage in CAC (P= 7.7 x 10%) (Methods) (Fig. 4b). 
Similarly, NFKB/Z mutations were rarely seen in cases of SCRC fromthe 
TCGA (2/356) and in a cohort of Japanese patients with sCRC (6/525); 
among all cases from both of these cohorts, only one had atruncating 
mutation (Fig. 4a, Extended Data Fig. 9b)—also supporting the negative 
selection of NFKB/Z-mutated cells. To test this hypothesis, we trans- 
duced a lentiviral library consisting of six NFKB/Z-targeting single-guide 
(sg)RNA constructs and four non-specific (control) sgRNA constructs 
for CRISPR-Cas9-mediated insertions or deletions into five human 
colorectal cancer (CRC) cell lines, and assessed the enrichment of cells 
that were targeted by each sgRNA construct during four weeks of cell 
culture (Extended Data Fig. 10a). We observed a significant reduction 
in cell populations that were transduced with NFKBIZ-directed sgRNAS 
compared with cells that were transduced with control sgRNA in all CRC 
cell lines (Fig. 4c). Moreover, the number and size of colorectal tumours 
that were induced by inflammation using DSS and azoxymethane (AOM) 
were significantly decreased in Nfkbiz“'Vill-cre mice compared with 
control Nfkbiz™ mice (Fig. 4d, e, Extended Data Fig. 1Ob-f). These 
results of functional analyses again corroborate the negative selection 
of NFKB/Z-mutated CRC cells. A significant reduction in the frequency 
of ZC3H12A mutations in CAC compared to UC epithelium was also 
observed, in sharp contrast to 7P53 and KRAS mutations, which were 
substantially enriched in CAC (Fig. 4b). 


Discussion 


Inthe steady state of the intestine that occurs in adult life, crypts inthe 
colon still divide to balance any physiological deficits of crypts. Assum- 
ing a constant number of crypts in the adult colon, 50,000 crypts are 
estimated to be lost annually and replaced by new ones through crypt 
fissions—equivalent to around 0.4% of the total colorectal crypts per 
year. In patients with UC, the rate of crypt fission is markedly accel- 
erated (around 2.16 fissions per crypt per year) to correct the larger 
deficits of crypts; strong positive selection operates to bring about 
the dominance of clones that carry mutations in the IL-17 and other 
cytokine and PRR pathways. The importance of these pathways in the 
pathogenesis of UC is supported by GWAS studies that report a signifi- 
cant association of UC with the NFKBIZ and TRAF3IP2 loci**”’. 

Despite a substantial overlap of affected driver genes between CAC 
and UC epithelia, their frequencies significantly differed between the 
two tissues, indicating that discrete mechanisms of positive selection 
exist in cancerous and non-cancerous tissues. Our results from the 
UC epithelium suggest that the mechanism that allows for positive 
selection under chronic inflammation may not necessarily contribute 
to neoplastic growth. On the contrary, it may impose strong negative 
effects on the development of cancer, as is evident from our finding that 
NFKBIZ mutations are extremely rare in CAC despite being pervasive 
innon-dysplastic samples of UC. 

Finally, one of the central implications of our study is the possibility 
that chronic inflammation can evoke a tissue response by eliciting the 
remodelling of tissues through positive selection of clones that acquire 
mutations—to the extent that the entire rectum becomes almost com- 
pletely reconstituted by a few clones that carry IL-17-pathway-related 
mutations. Chronicinflammation is highly prevalent among the human 
population and sucha mechanism of tissue remodelling through posi- 
tive selection may potentially modify the picture of disease in affected 
organs and tissues depending on the nature of inflammation. A com- 
prehensive understanding of such responses will be important for 
improving our knowledge of the pathogenesis of inflammatory diseases 
and for the development of novel therapeutics. 


Online content 


Any methods, additional references, Nature Research reporting sum- 
maries, source data, extended data, supplementary information, 


acknowledgements, peer review information; details of author con- 
tributions and competing interests; and statements of data and code 
availability are available at https://doi.org/10.1038/s41586-019-1856-1. 


1. Grivennikov, S. |., Greten, F. R. & Karin, M. Immunity, inflammation, and cancer. Cell 140, 
883-899 (2010). 

2. — Hunter, P. The inflammation theory of disease. The growing realization that chronic 
inflammation is crucial in many diseases opens new avenues for treatment. EMBO Rep. 
13, 968-970 (2012). 

3. Nathan, C. & Ding, A. Nonresolving inflammation. Cell 140, 871-882 (2010). 

4. — Jaiswal, S. et al. Age-related clonal hematopoiesis associated with adverse outcomes. 
N. Engl. J. Med. 371, 2488-2498 (2014). 

5. Martincorena, |. et al. Tumor evolution. High burden and pervasive positive selection 
of somatic mutations in normal human skin. Science 348, 880-886 (2015). 

6. Yokoyama, A. et al. Age-related remodelling of oesophageal epithelia by mutated cancer 
drivers. Nature 565, 312-317 (2019). 

7. Burisch, J., Jess, T., Martinato, M. & Lakatos, P. L. The burden of inflammatory bowel 
disease in Europe. J. Crohns Colitis 7, 322-337 (2013). 

8. Ungaro, R., Mehandru, S., Allen, P. B., Peyrin-Biroulet, L. & Colombel, J. F. Ulcerative 
colitis. Lancet 389, 1756-1770 (2017). 

9. Eaden, J. A., Abrams, K. R. & Mayberry, J. F. The risk of colorectal cancer in ulcerative 
colitis: a meta-analysis. Gut 48, 526-535 (2001). 

0. Kang, H. & Shibata, D. Direct measurements of human colon crypt stem cell niche 
genetic fidelity: the role of chance in non-Darwinian mutation selection. Front. Oncol. 3, 
264 (2013). 

1. Choi, C. R., Bakir, |. A., Hart, A. L. & Graham, T. A. Clonal evolution of colorectal cancer in 
IBD. Nat. Rev. Gastroenterol. Hepatol. 14, 218-229 (2017). 

2. Eto, A., Muta, T., Yamazaki, S. & Takeshige, K. Essential roles for NF-KB and a Toll/IL-1 
receptor domain-specific signal(s) in the induction of IkB-@. Biochem. Biophys. Res. 
Commun. 301, 495-501 (2003). 

3. Yamamoto, M. et al. Regulation of Toll/IL-1-receptor-mediated gene expression by the 
inducible nuclear protein IkB@. Nature 430, 218-222 (2004). 

4. Bruno, M.E.C., Frantz, A.L., Rogier, E. W., Johansen, F. E. & Kaetzel, C. S. Regulation of the 
polymeric immunoglobulin receptor by the classical and alternative NF-KB pathways in 
intestinal epithelial cells. Mucosal Immunol. 4, 468-478 (2011). 

5. Cao, A.T., Yao, S., Gong, B., Elson, C. O. & Cong, Y. Th17 cells upregulate polymeric Ig 
receptor and intestinal IgA and contribute to intestinal homeostasis. J. Immunol. 189, 
A666-4673 (2012). 

6. Bruno, M.E. et al. Correlation of biomarker expression in colonic mucosa with disease 
phenotype in Crohn's disease and ulcerative colitis. Dig. Dis. Sci. 60, 2976-2984 
(2015). 

7. Nakatsuka, Y. et al. Pulmonary Regnase-1 orchestrates the interplay of epithelium and 
adaptive immune systems to protect against pneumonia. Mucosal Immunol. 11, 1203-1218 
(2018). 

8. Mino, T. et al. Regnase-1 and Roquin regulate a common element in inflammatory mRNAs 
by spatiotemporally distinct mechanisms. Cell 161, 1058-1073 (2015). 

9. Iwasaki, H. et al. The IkB kinase complex regulates the stability of cytokine-encoding 
mRNA induced by TLR-IL-1R by controlling degradation of regnase-1. Nat. Immunol. 12, 
1167-1175 (2011). 

20. Reznik, B., Clement, S. L. & Lykke-Andersen, J. hnRNP F complexes with tristetraprolin and 

stimulates ARE-mRNA decay. PLoS One 9, e100992 (2014). 
21. Ramirez-Carrozzi, V. et al. IL-17C regulates the innate immune function of epithelial cells 
in an autocrine manner. Nat. Immunol. 12, 1159-1166 (2011). 


22. Robles, A. I. et al. Whole-exome sequencing analyses of inflammatory bowel disease- 
associated colorectal cancers. Gastroenterology 150, 931-943 (2016). 

23. Fujita, M. et al. Genomic landscape of colitis-associated cancer indicates the impact of 
chronic inflammation and its stratification by mutations in the Wnt signaling. Oncotarget 
9, 969-981 (2018). 

24. Din, S. et al. Mutational analysis identifies therapeutic biomarkers in inflammatory 
bowel disease-associated colorectal cancers. Clin. Cancer Res. https://doi.org/ 
10.1158/1078-0432.CCR-17-3713 (2018). 

25. Baker, A. M. et al. Evolutionary history of human colitis-associated colorectal cancer. 
Gut 68, 985-995 (2019). 

26. Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of 
inflammatory bowel disease. Nature 491, 119-124 (2012). 

27. Liu, J. Z. et al. Association analyses identify 38 susceptibility loci for inflammatory bowel 
disease and highlight shared genetic risk across populations. Nat. Genet. 47, 979-986 
(2015). 


Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© The Author(s), under exclusive licence to Springer Nature Limited 2019 


‘Department of Pathology and Tumour Biology, Kyoto University, Kyoto, Japan. 7Institute for 
the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan. 
’Department of Gastroenterology and Hepatology, Kyoto University, Kyoto, Japan. 
4Department of Inflammatory Bowel Disease, Division of Surgery, Hyogo College of 
Medicine, Nishinomiya, Japan. ‘Department of Surgical Pathology, Hyogo College of 
Medicine, Nishinomiya, Japan. Department of Medical Chemistry, Graduate School of 
Medicine, Kyoto University, Kyoto, Japan. Department of Surgery, Kyoto University, Kyoto, 
Japan. °Gastroenterological Center, Department of Gastroenterological Surgery, Cancer 
Institute Hospital, Japanese Foundation for Cancer Research, Tokyo, Japan. Department of 
Therapeutic Oncology, Graduate School of Medicine, Kyoto University, Kyoto, Japan. 
Department of Gastroenterology and Hepatology, Kyorin University School of Medicine, 
Tokyo, Japan. "Department of Diagnostic Pathology, Kyoto University Hospital, Kyoto, 
Japan. Department of Diagnostic Pathology, Kyoto University, Kyoto, Japan. “Division of 
Molecular Oncology, National Cancer Centre Research Institute, Tokyo, Japan. “Laboratory 
of DNA Information Analysis, Human Genome Centre, Institute of Medical Science, The 
University of Tokyo, Tokyo, Japan. “Laboratory of Sequence Analysis, Human Genome 
Centre, Institute of Medical Science, The University of Tokyo, Tokyo, Japan. "Department of 
Advanced Diagnosis, Clinical Research Centre, National Hospital Organization Nagoya 
Medical Center, Nagoya, Japan. Research and Development Centre for Precision Medicine, 
University of Tsukuba, Tsukuba, Ibaraki, Japan. ‘Department of Immunology, Akita 
University Graduate School of Medicine, Akita, Japan. "Division of Experimental 
Therapeutics, Kyoto University, Kyoto, Japan. ?°DSP Cancer Institute, Sumitomo Dainippon 
Pharma, Osaka, Japan. “'Division of Hepato-Biliary-Pancreatic Surgery and Transplantation, 
Department of Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan. 
22Department of Paediatric Surgery, Kanazawa Medical University, Ishikawa, Japan. 
23Department of Gastroenterology and Hepatology, Sapporo Medical University School of 
Medicine, Sapporo, Japan. “Kansai Electric Power Hospital, Osaka, Japan. Department of 
Medicine, Centre for Haematology and Regenerative Medicine, Karolinska Institute, 
Stockholm, Sweden. *e-mail: sogawa-tky@umin.ac.jp 


Nature | Vol577 | 9 January 2020 | 265 


Article 


Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 
Investigators were blinded to the genotypes of mice until the statisti- 
cal evaluation. 


Subjects and materials 

We enrolled a total of 647 patients who underwent surgery or endos- 
copy of the lower gastrointestinal tract at Kyoto University Hospital, 
Hyogo College of Medicine Hospital and Cancer Institute Hospital. 
Among these, 76 had been diagnosed with UC and the remaining 571 
were either diagnosed with other diseases such as colorectal polyp or 
cancer without associated colitis or turned out to be healthy individu- 
als. This study was approved by the ethical committees at Kyoto Uni- 
versity, Hyogo College of Medicine and Cancer Institute Hospital. We 
have complied with all relevant ethical regulations. Informed consent 
was obtained from all participants. Characteristics of the participants 
are summarized in Supplementary Table 1. 

Specimens from individuals without UC were endoscopically or surgi- 
cally obtained from non-dysplastic intestinal mucosa. After methylene 
blue staining, they were carefully assessed under a stereomicroscope 
to exclude samples involving dysplastic crypt lesions such as aber- 
rant crypt foci. All specimens obtained from patients with UC were 
pathologically evaluated using half of each specimen or its adjacent 
mucosa. Among them, six surgically resected rectum samples were 
comprehensively examined by 5-mm sections across the whole sam- 
pling area in 1-cm lattices. On the basis of Riddell’s classification’®, 
pathological evaluation was performed by two expert pathologists 
(Y.T. and T. Sakurai, or T.K. and S.H.) and samples were classified into 
four groups: non-dysplasia, low-grade dysplasia, high-grade dysplasia 
and cancer. 

To obtain single crypts and crypts in bulk, samples obtained by endo- 
scopic biopsy were trimmed to a size of approximately 4 mm’. From 
surgical specimens, samples were collected by punch biopsy of 2.5mm 
in diameter (4.9 mm’) (BP-25F, Kai Industries). After being separated 
from the submucosal layers by incubating samples in 20 mM ice-cold 
EDTA in phosphate-buffered saline (PBS) for 20 min, the epithelium 
was mechanically dissociated using fine forceps, and single-crypt or 
bulk-crypt samples were collected (Extended Data Fig. 2a). To obtaina 
geographically mapped cluster of crypts, two methods were used. For 
non-UC samples, the crypt structure within the mucosa was first fixed 
with acrylic adhesive from the epithelial surface, then the underneath 
cluster of crypts was peeled off en bloc from the submucosal layers 
after treatment with 20 mM EDTA. For UC samples, a gentle pressure 
was applied to the whole mucosal tissue from both sides to push up 
crypts after treatment with 20 mM EDTA. Finally, individual crypts were 
isolated and their position was recorded and numbered (Extended Data 
Fig. 2b). For the analysis of cancer samples, tumour cells were freshly 
obtained or dissected from formalin-fixed paraffin-embedded (FFPE) 
tissue under a stereomicroscope and the DNA was extracted. 

For WES, genomic DNA isolated from each single crypt was split into 
two aliquots, each of which was independently subjected to whole- 
genome amplification (WGA) with the REPLI-g Single Cell Kit (Qiagen) 
and used for either WES or subsequent validation. For other freshly 
obtained samples, genomic DNA and RNA were extracted simultane- 
ously using the AllPrep DNA/RNA Micro Kit (Qiagen). DNA was extracted 
from FFPE cancer samples using the GeneRead DNA FFPE Kit (Qiagen). 
DNA from peripheral blood or tumour-free muscularis propria that 
was macro-dissected from FFPE cancer samples was extracted using 
the Gentra Puregene Kit (Qiagen) or GeneRead DNA FFPE Kit (Qiagen), 
respectively, and used as a germline control. For targeted capture 
sequencing of sCRCs, genomic DNA was extracted from cryo-preserved 


samples using the AllPrep DNA/RNA Mini Kit (Qiagen). Sample informa- 
tion is summarized in Supplementary Tables 2 and 3. 


Whole-exome sequencing 

WES libraries were prepared using SureSelect Human All Exon VS (Agi- 
lent Technologies) or xGen Exome Research Panel (IDT), followed by 
sequencing of enriched exon fragments on a HiSeq 2500 or NovaSeq 
6000 system (Illumina) in 100-150-bp paired-end mode as previously 
described”. The target depth was 100~, and the actual depth was 133x 
(60-293x). The mean depth in germline controls from 122 individuals 
was 140x(55-204x). Mutation calling was performed using the Geno- 
monz2 pipeline (v.2.6), as previously described’. In brief, sequencing 
reads were aligned to the human reference genome (GRCh37) using 
Burrows-Wheeler Aligner (v.0.7.8) with default parameter settings. 
PCR duplicates were eliminated using biobambam (v.0.0.191). Somatic 
mutations were detected by eliminating polymorphisms and sequenc- 
ing errors. To achieve this, GGnomonz2 first discards any low-quality, 
unreliable reads and variants according to the following criteria: (i) 
mapping quality < 20, (ii) base call quality < 15. After further excluding 
those variants that are not supported by (i) a sufficient number of reads 
(total reads > 8 and variant reads > 3); (ii) variant allele frequencies 
(VAFs) > 0.05 (for fresh and FFPE samples), > 0.25 (for single-crypt WGA 
samples) and < 0.02 (for germline control); (iii) astrand ratio not equal 
to 0 or1, the remaining variants were interrogated to establish whether 
they were observed at significantly higher VAFs than expected for 
errors (P<10*, P<10°° and P<10~“ for fresh, FFPE and WGA samples, 
respectively). Significance was evaluated by the EBCall algorithm®”°, 
onthe basis of an empirical distribution of VAFs as determined using 
WES data of non-paired peripheral blood samples (n= 20). Putative 
germline variants were also excluded by comparing VAFs with matched 
controls using the Fisher's exact test (P< 107, P< 10‘ and P<10° for 
fresh, FFPEand WGA samples, respectively), which also eliminated any 
remaining sequencing errors. The monoclonal origin of single-crypt 
cells, which are derived from a single stem cell at a given time, was 
evident from the normal distribution of VAFs around 0.5 (Extended 
Data Fig. 3a, b). 

For mapping samples of clusters of crypts and surgically resected 
rectums, each of the mutations detected in a sample was interro- 
gated in all of the remaining samples. If a variant detected in one or 
more samples by the above mutation call was also present (VAF > 0.1 
and > 0.05 for single-crypt mapping and rectum mapping, respectively) 
in another sample within the same mapping specimen, the variant was 
considered to be true positive, because it has a high prior probability 
for true positivity. 


Validation of detected mutations 

Validation of mutations detected by WES in fresh and FFPE samples 
was performed using PCR-based deep (>500x) sequencing as previ- 
ously described”, in which 708 single-nucleotide variants (SNVs) 
and 28 indels were randomly selected and evaluated. A mutation was 
considered to be validated when (i) the sequencing depth was > 500x 
in both test and germline control samples; (ii) the VAF in the test 
sample was 5 times higher than that in the corresponding germline 
control sample; and (iii) the VAF in the test sample was high enough 
(= 0.01) to be captured by the lower threshold in WES (0.05)°. The 
overall validation rate was 99.0% for fresh samples (390/394 muta- 
tions) and 97.4% for FFPE samples (333/342 mutations). Validation 
of mutations detected in WGA DNA of single crypts was performed 
using WES of replicated samples (Extended Data Fig. 2c), in which 
126 SNVs and indels detected in 3 single crypts were evaluated. A 
mutation was considered to be validated when (i) the sequencing 
depth in the replicated sample was = 8; (ii) the VAF in the replicated 
sample was > 0.2. The validation rate of WGA DNA was 98.4% (124/126 
mutations). The results of validation sequencing are summarized in 
Supplementary Table 4. 


External dataset 

WES data (bam files) of paired tumour and germline control samples 
from patients with sCRC were downloaded from the TCGA data portal. 
Bam files from the TCGA were converted to fastq format using bioba- 
mbam and processed with the same pipeline for mutation calling as 
was applied to the fresh samples that were obtained in this study (see 
above). For accuracy, sequencing data generated from tumours using 
WGA were excluded. Hypermutated tumours (> 12 mutations per 10° 
bases) were also excluded from the comparison of driver genes between 
non-dysplastic samples from patients with UC and TCGA sCRC sam- 
ples. Lists of mutations in 65 cases of colitis-associated cancer (CAC) 
in individuals with UC were available from four previous publications 
of UC-CAC WES studies”. Samples included in this study are sum- 
marized in Supplementary Table 2. 


Analysis of copy-number alterations 

CNAs were evaluated from WES data using our in-house pipeline, 
‘CNACS”*. The fractions of genomic regions with CNAs were compared 
between non-dysplasia, dysplasia and CAC in patients with UC. For 
expanded clones that were observed in more than one sample, the 
average fraction with CNAs was used. 


Estimation of mutant-cell fractions 

Precise estimation of the fraction of cells that have asomatic mutation 
(mutant-cell fraction (MCF)) might be complicated when the muta- 
tioninvolves a gene located onthe chromosomal segment that shows 
a CNA. Most mutations detected by WES in isolated non-dysplastic 
samples were outside of regions with CNAs. Thus, for those mutations 
(7,024/7,072), the MCF was simply estimated from their VAFs as follows: 


MCF =2 VAF (for autosomes and the X chromosome in females); or 
MCF = VAF (for the X chromosome in males). 


By contrast, when the mutated locus does show an abnormal copy 
number or allelic imbalance, estimation of the MCF depends onseveral 
factors that may not be inferred in a deterministic way—particularly in 
cases in which a sample contains multiple clones with different com- 
positions of mutant alleles. However, for a mutation in a region that 
exhibits copy-number loss or allelic imbalance, using the following 
calculation does not lead to overestimation of the MCF. Thus, we esti- 
mate the MCF of such mutations (31/7,072) as follows: 


MCF =2 x VAF — %UPD/100 (for uniparental disomies; UPDs); or 
MCF = VAF x (2- %Del/100) (for deletions), 


where %UPD and %Del denote the estimated fraction (%) of the respec- 
tive abnormality. Other mutations associated with copy-number gain 
(17/7,072) were excluded from the MCF analysis in Extended Data Fig. 5. 


Analysis of significantly mutated (driver) genes 

Significantly mutated genes, or driver genes, were investigated onthe 
basis of dN/dS, which was calculated for all genes using the dNdScv 
package in R. For surgically resected rectum specimens, for which 
samples were obtained according to 1-cm lattices, mutations fromthe 
same clone could be detected in multiple samples. Such observations 
of multiple mutations belonging to the same clone needed be corrected 
before the analysis. To achieve this, we first clustered samples into 
sets in which samples shared four or more mutations with VAF > 0.25 
(Supplementary Table 5), or otherwise treated them as single samples. 
Each mutation detected ina given case was included in either asample 
set or a single sample that had the highest VAF of the mutation. After 
applying dNdScv to the mutations in non-dysplasia (36 sample sets 
including 152 samples, and 243 single samples) and non-hypermutated 
CAC (56 samples including our samples and publicly available data for 


which information on synonymous mutations was present) in patients 
with UC separately, those genes that were significantly mutated (dN/ 
dS>1,q<0.05) and found in>4% of non-dysplasia or CAC samples were 
considered as drivers (14 genes in non-dysplasia; 6 genes in CAC) (Sup- 
plementary Table 6). Driver genes were also analysed using MutSigCV, 
which confirmed 10/14 and 4/6 genes as being significantly mutated 
(q< 0.05) innon-dysplasia and cancer in patients with UC, respectively. 

In UC non-dysplasia, we investigated loss-of-function mutations in 
genes annotated as cytokine receptor (n = 108) or PRR (n = 33) genes. 
Through WES of 399 non-dysplastic samples from 28 patients with UC, we 
identified a total of 2,575 truncating mutations (including those affecting 
splice sites). The significance of the mutations in each gene was evaluated 
by calculating the type-I error rate, assuming a Poisson distribution with 
observed mutation rate for truncating mutations (2,575/33,479,141 bp) 
and the length of each gene. Multiple testing was corrected using the 
Benjamini-Hochberg method. Genes with g < 0.05 were considered 
to be significantly mutated. We also calculated dN/dS values for these 
141 genes. Information onthe cytokine receptor and PRR genes is sum- 
marized in Supplementary Table 7. Genes with a protein-truncating 
mutation that affects acomponent of the IL-17 signalling pathway were 
considered to be potential drivers. In total, 20 genes were considered to 
be related to positive selection in UC non-dysplasia (Fig. 2b). 


Targeted capture deep sequencing of driver genes 

Targeted capture deep sequencing of 525 non-hypermutated sCRCs 
froma Japanese cohort was performed according to the manufacturer’s 
protocol, using acustom bait library (xGen Predesigned Gene Capture 
Pools; IDT) that was designed to capture 16 genes that include driv- 
ers in either UC non-dysplasia or CAC (Supplementary Table 6). After 
hybridization capture, the enriched DNA fragments were sequenced 
ona HiSeq 2500 system. The mean depth was 1,879 (364-2,545x). 
Relevant somatic mutations were called using the Genomon2?2 pipe- 
line and were filtered by excluding (i) synonymous SNVs; (ii) EBCall 
Pvalue>10™*: (iii) VAF < 0.02; (iv) strand ratio =0 or 1; (v) known variants 
listed in SNP databases (1000 Genome Project, Exome Sequencing Pro- 
ject 6500, Human Genetic Variation Database and Exome Aggregation 
Consortium). Candidate mutations were further filtered by removing 
missense SNVs with a VAF of 0.4-0.6, except for SNVs that have 5 or more 
entries in the Catalogue of Somatic Mutations in Cancer (COSMIC) 
(v.70). Detected mutations are summarized in Supplementary Table 8. 


Mutational signature 

Mutational signatures in 29,409 SNVs—consisting of 1,537 non-UC- 
derived SNVs detected by WES of single crypts (n = 43); 16,428 SNVs 
of non-dysplastic samples from patients with UC, which were either 
directly detected from WES of single crypts (n = 58) or assigned to 
PyClone-imputed clones from bulk-crypt samples (n=183); 2,022 SNVs 
assigned to clones from dysplastic samples from patients with UC 
(n=13); and 9,422 SNVs detected in non-hypermutated CAC (n= 92)— 
were extracted using the R package pmsignature. Extracted signatures 
were compared with the COSMIC signatures using the R package decon- 
structSigs. Signature A was confirmed to be highly similar to COSMIC 
signature 1 and signature B to COSMIC signature 17. The frequencies 
of signature B were compared between non-dysplastic samples from 
control individuals, and non-dysplastic, dysplastic and CAC samples 
or clones from patients with UC. Samples or clones with fewer than 
10 mutations were excluded from the analysis. 


Detection of crypt expansion in the colorectal epithelium in 
patients with UC 

To identify independent clones for isolated bulk-crypt samples (n=133) 
from patients with UC, clonal composition in these samples was inter- 
rogated using PyClone (v.0.13.0), in which a set of clones that have a 
discrete set of mutations were imputed together with their estimated 
cellular prevalence. For samples for geographical mapping in the 
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rectum (n = 294; 49 samples per individual on average (range, 41-57) 
from six patients with UC), we first identified those samples that shared 
four or more mutations with VAF > 0.25, which were considered to 
contain a clone expanded fromthe same ancestral crypt and therefore 
analysed as a group using PyClone. For other geographical samples, 
samples were analysed as isolated samples (Supplementary Table 5). 
When the maximum cellular prevalence of mutational clusters is more 
than 0.5, the Pigeonhole principle can be used to decide whether the 
dominant cell fraction includes other minor fractions of mutated cells 
or not. Thus, only samples with a maximum cellular prevalence > 0.5 
were included for the analyses of expanded crypts, and minor muta- 
tional clusters were included only when the sum of such dominant 
cell fractions and minor cell fractions exceeded 1.0 (Extended Data 
Fig. 1d). Driver mutations, and their clone size, that were detected in 
individual samples in the geographical mapping of six patients with 
UC are summarized in Supplementary Table 9. 


Phylogenetic analysis 

To analyse the evolutionary history of clusters of crypts by microscale 
mapping, genotype information at every locus that was mutated in 
one or more samples was used to construct a phylogenetic tree. The 
maximum parsimony method using the Subtree-Pruning-Regrafting 
algorithm was performed by MEGA X, and the most-parsimonious 
tree from 1,000 replicates was obtained. Time points for the onset 
of UC in phylogenetic trees were estimated from the age of UC onset 
and the rate of mutation rate in control individuals (Fig. 1a). The age 
of the most-recent common ancestral crypt in UC trees was estimated 
from the time point of UC onset, the height of the first branch point 
and the average number of mutations in analysed crypts within each 
patient with UC (assuming a linear accumulation of mutations after 
the onset of UC). 

Inthe multi-sampling analysis for rectum samples, phylogenetic trees 
were constructed for those clones observed in two or more samples. 
The assignment of different subclones to distinct clonal structures was 
accomplished according to the Pigeonhole principle, on the basis of 
the cellular prevalence of each mutational cluster as estimated from 
PyClone (as described previously’). In brief, when and only when the 
sum of their cellular prevalence exceeded 1.0, the two clusters were 
thought to be inthe same clonal structure. For clusters in which one hada 
smaller cellular prevalence, the cluster was assigned as a subclone of the 
other cluster with larger cellular prevalence. All clusters for which clonal 
structures were undetermined were excluded from further analyses. 
Representative phylogenetic trees are shown in Extended Data Fig. 7c-h. 


Estimation of the rate of crypt fission 

In control individuals, the rate of crypt fission (fissions per crypt per 

year) before 20 years of age was estimated on the basis of the number 

of ancestral crypts at birth and the number of crypts at 20 years of 

age in phylogenetic trees in three clusters of crypts, according to the 

following formula: 
Fission rate = (log,(N, 


, at 20 years old) — log,(N-,yp¢at birth))/20, 


Typ ryp' 


where N,,yp,is the number of crypts and where we assumed that: (i) every 
crypt had <1 somatic mutation at the time of birth; (ii) the intestine 
continues developing until 20 years of age, during which time crypts 
will not be lost. 

The fission rate after 20 years of age was estimated on the basis of 
the number of branches that occur after 20 years of age to the sampling 
time point, according to the following formula: 


Fission rate= (Noranch point! Nerypt)/ Yarter 20» 


where Npranch point iS the number of branch points after 20 years of age 
(12.2 mutations) and Yre,29 denotes years, with 20 subtracted from 


the age of the subject. We assumed that: (i) colon size does not change 
after 20 years of age; (ii) crypt fissions occur only to balance the physi- 
ological deficits. 

In patients with UC, the fission rate was estimated on the basis of the 
rate of mutation after disease onset and the time point of the most- 
recent common ancestral branch point of analysed crypts, according 
to the following formula: 


Fission rate = logy(Nerypt)/Yartermrcar 


where Vrermrca denotes the number of years after the most-recent com- 
mon ancestral (MRCA) branch point to the age of the subject, and where 
we assumed that: (i) the rate of mutation after the disease onset was 
constant; (ii) crypts were not lost after the time point of the most-recent 
common ancestral crypt. 


Comparison between the mutation frequency in CACs and the 
mutated fraction in UC epithelium 

There was a substantial discrepancy in mutation frequencies in driver 
genes between CAC and non-dysplastic samples from patients with 
UC. Thus, to evaluate the positive or negative role of mutations in 
each driver gene in colitis-associated carcinogenesis, we compared 
the mutation frequency of each gene in 99 samples of CAC (Fig. 4a) 
with that expected from the fraction of the mutated area in UC colo- 
rectal epithelium under the null hypothesis H,: the gene mutation has 
no selective effect on the development of CAC from background UC 
colorectal epithelium. First, for each driver gene, the mean fraction of 
mutated area (Fuc epithelia) in non-cancer samples from patients with UC 
(n = 432) was calculated by using the model of cellular prevalence in 
PyClone. Then, the 95% confidence interval (95% Cl) of the expected 
mutation frequency in CACs on the basis of the mutated fraction in UC 
colorectal epithelium was calculated for each gene, from a binomial 
distribution, B(¢, 99) using Fic epithetia aS. an estimated value of prevalence 
(t) of mutated area. For each driver gene, the significance of the differ- 
ence between the observed and the expected mutation frequency in 
CAC was calculated by two-sided binomial test. Multiple testing was 
corrected using the Benjamini-Hochberg method. Detailed results 
are summarized in Supplementary Table 10. 


Cell culture 

HeLa cells were maintained in DMEM (08459-64, Nacalai Tesque) sup- 
plemented with 10% FBS and 100 pM 2-mercaptoethanol (21418-42, 
Nacalai Tesque). HCT116, HT29, SW480, DLD1 and LoVo cells were 
maintained in DMEM supplemented with 10% FBS. 


Introduction of wild-type and mutant ZC3H12A 

cDNAs of full-length human wild-type ZC3H12A (encoding the RNase 
ZC3H12A, or Regnase-1) and a C-terminal truncation mutant (W543*) of 
this gene were inserted into pcDNA3.1(+) (V79020, Thermo Fisher Scien- 
tific) together with the N-terminal Myc-tag sequence. The $438L mutant 
ZC3H12A was created using the QuikChange Lightning Site-Directed 
Mutagenesis Kit (210519, Agilent). Plasmid DNA was transfected into 
HCT116 cells using Lipofectamine 2000 (11668500, Thermo Fisher 
Scientific) according to the manufacturer’s protocol. 


Degradation assay of ZC3H12A upon cytokine stimulation 

HCT116 cells were stimulated with 50 ng mI recombinant human 
IL-17A (570502, BioLegend) or10ng mI“IL-1B (201-LB-005, R&D Systems). 
Cells were lysed in the lysis buffer (20 mM Tris-HCl (pH = 7.4), 150 mM 
NaCl, 0.5% NP-40, cOmplete Mini (EDTA-free) (11836170001, Roche), 
200 units per ml RNaseOUT (10777019, Thermo Fisher Scientific)). 
Protein samples were resolved on a 7.5% PAGE gel (E-T7.5L, ATTO) and 
transferred to Immun-Blot PVDF membranes (1620177, Bio-Rad). The 
following antibodies were used for immunoblot analysis: monoclo- 
nal anti-c-Myc (M4439, Sigma-Aldrich), monoclonal anti-beta-actin 


(sc-47778 HRP, Santa Cruz) and anti-mouse IgG HRP (NA9310-1ML, GE 
Healthcare). Band intensities were analysed using Amersham Imager 
600 Analysis Software (GE Healthcare). For the analysis of protein/ 
mRNA ratio, total RNAs were isolated using TRIzol Reagent (15596018, 
Thermo Fisher Scientific) and reverse-transcribed using ReverTra Ace 
qPCRRT Master Mix with gDNA Remover (FSQ-301, TOYOBO) according 
to the manufacturer’s protocol. The cDNA fragments were amplified 
with PowerUp SYBR Green Master Mix (A25742, Thermo Fisher Scien- 
tific). Primers are described in Supplementary Table 11. 


Degradation assay of ZC3H12A under BTRC inhibition 

HeLa cells were infected with CS-TRE-Flag-BTRCAF-PRE-Ubc-tTA- 
12G lentivirus, and treatment with doxycycline (Dox) induced the 
expression of the dominant-negative form of BTRC (BTRC(AF))”. 
After administration of 100 ng ml“ Dox, cells were stimulated with 50 
ng ml‘ recombinant human IL-17A (570502, BioLegend) or 10 ng mI 
IL-1B8 (201-LB-005, R&D Systems). Cells were lysed and protein sam- 
ples were prepared as described above. The following antibodies were 
used for immunoblot analysis: polyclonal anti-ZC3H12A (HPA032053, 
Atlas Antibodies), polyclonal anti-Flag (F7425, MERCK), monoclonal 
anti-beta-actin and anti-rabbit IgG HRP (NA9340V, GE Healthcare). 


Analysis of the effect of NFKBIZ on growth of cancer cells 

sgRNAs targeting NFKBIZ were designed, and knockout efficiency was 
confirmed by Guide-it Mutation Detection Kit (Z1448N, TAKARA). Five 
human CRC-derived cell lines (HT29, SW480, DLD1, HCT116 and LoVo) 
were infected witha lentivirus library generated from pL-CRISPR.EFS. 
GFP (Addgene 57818), containing SpCas9 and sgRNAs that target NFK- 
BIZ(sgNFKBIZ #1-6) or non-targeting control sgRNAs (non-targeting 
#1-4) ata multiplicity of infection of 0.2 (Extended Data Fig. 10a). GFP- 
positive cells were sorted using a FACSAria IIlu (BD Biosciences) three 
days after infection. Half of the sorted cells were used for DNA extrac- 
tion and the remaining half were cultured in 6-well plates. After four 
weeks of culture, 50 ng of genomic DNA extracted from each culture 
was subjected to PCR amplification of integrated sgRNAs, together 
with the DNA from pre-culture cells, for amplicon sequencing using 
NextSeq 550 (Illumina). The fold change (FC%"4) of the frequency of 
sgRNA-containing reads between pre-culture (Freq’?*“/"*) and post- 
culture (Freq’?®™4/"s') samples was calculated and normalized for the 
mean value of control sgRNAs as follows: 


FCSeRNA = Freqs2XA/Post/F req seRNa/Pre 
Normalized FC®*™4 = FCS8"%“/mean FC for control sgRNAs. 


Significant differences between sgRNAs that target NFKBI/Z and non- 
targeting sgRNAs were calculated by two-sided Mann-Whitney Utest. 
PCR primers and sgRNA sequences are provided in Supplementary 
Tables 11 and 12, respectively. 


Animal experiments 

Allin vivo experiments in animals were approved by Kyoto University 
Animal Care and Use Committee. Genetically engineered C57BL/6 mice 
were used. Vill-cre mice (004586) and R26-LacZ mice (003309) were 
obtained from The Jackson Laboratory. Nfkbiz flox mice” (RBRC06410) 
were obtained from Riken BRC. Vill-cre™ mice” were a gift from 
S. Robine. 


Cellular response to inflammatory stimulation in colon 
organoids 

Colon organoids were generated from Nfkbiz“ Vill-cre (CKO) mice 
and Nfkbiz (control) mice and were maintained in basement mem- 
brane matrix (Matrigel) (354234, Corning) with 50% L-WRN condi- 
tioned medium (CM) as previously described**. Control organoids 
were stimulated with 50 ng ml“ IL-1B (575102, BioLegend), 50 ng mI 
IL-4 (574304, BioLegend), 50 ng mI IL-13 (575904, BioLegend), 50 ng 


mI IL-17A (421-ML-025, R&D Systems), 50 ng mI IL-19 (2915-IL-025, 
R&D Systems), 50 ng mI IL-20 (1204-ML-025, R&D Systems), 50 ng 
mI IL-24 (7807-ML-010, R&D Systems), 50 ng mI“ IL-31 (210-31-10UG, 
PeproTech), 50 ng mI oncostatin M (762804, BioLegend), 50 ng mI 
TNFa (575202, BioLegend), 50 ng mI lymphotoxin a1/B2 (9968-LY- 
025/CF, R&D Systems), 10 pg mI LPS (L6529-1MG, Sigma), 2.5 uM CpG 
oligodeoxynucleotides (ALX-746-001-C100, Enzo) and PBS (control) for 
Lhor 24h, followed by RNA extraction. The expression levels of Nfkbiz 
and Gapdh were measured by quantitative PCR. The expression levels 
of Nfkbiz, Pigrand Gapdh of organoids stimulated with IL-17A were also 
measured by quantitative PCR at O, 1, 4.and 24 h after stimulation. For 
each gene, the expression level was calculated as a ratio relative to 
Gapdh. Primer sequences are described in Supplementary Table 11. 


Competition assay of colon organoids from Nfkbiz’ and 
Nfkbiz’” mice 

An equal number of colon organoids derived from Nfkbi“Vill-cre 
(Nfkbiz’) and Nfkbiz™ (Nfkbiz‘*) mice were co-cultured in 50% L-WRN 
CM with or without 50 ng mI‘ IL-17A stimulation. Organoids were 
passaged in the ratio of 1:5 every 5 days. Genomic DNA was extracted 
at every passage, followed by measurement of Nfkbiz and Nfkbiz* 
alleles using quantitative PCR. For each passage, the ratio of Nfkbiz to 
Nfkbiz' alleles was calculated. Then, the fold change from the ratio at 
the start of culture was calculated for every passage. Primer sequences 
are described in Supplementary Table 11. 


Clonal expansion of Nfkbiz-knockout colorectal epithelium in 
chronic DSS-induced colitis 

R26'°"™N¢kbiz™ Vill-cre®® (cKO) mice and R266°2""Nfkbiz”™ Vill-cre® 
(control) mice were co-housed after weaning. At 6 weeks of age, Cre- 
mediated recombination was induced by intraperitoneal injection of 2 
mg of tamoxifen (T5648, SIGMA) twice a day for 2 consecutive days. At 
8to 9 weeks of age, experimental chronic colitis was induced by three 
cycles of 1% DSS (molecular weight = 36,000-50,000 Da; MP Biomedi- 
cals) (5 days DSS, followed by 16 days water). During three cycles of DSS, 
body weight was measured three times a week. Mice were killed before 
or after three cycles of DSS and colon lengths were measured. The colon 
was opened longitudinally and mounted, then stained by X-gal (9031, 
Takara) according to the manufacturer’s protocol. Images of the colon 
were taken under a stereomicroscope and LacZ-positive areas from 
middle to distal colon were measured using Image] software. Histologi- 
cal scores of colitis in which epithelial damage and inflammatory-cell 
infiltration were scored from O to 12 as previously described™ were 
counted on 5 randomly selected HPFs of H&E-stained slides in each 
mouse. The mean values of the scores were used for the analysis. 


Transcriptome analysis of colorectal epithelial cells in DSS- 
induced colitis 

Colorectal epithelial cells were obtained from rectum mucosa from 
8-week-old Nfkbiz“'Vill-cre (cKO) and Nfkbiz™ (control) mice that 
underwent 4 days of 1% DSS treatment. RNA was extracted from 
epithelial cells in distal colon by using the AllPrep DNA/RNA Micro 
Kit (Qiagen). Libraries for RNA sequencing were prepared using the 
NEBNext Ultra RNA Library Prep kit for Illumina (New England Biolabs) 
and subjected to sequencing using a HiSeq 2500 instrument (Illumina) 
with a standard 125-bp paired-end protocol. 

The sequencing reads were aligned to the mouse reference genome 
(mm{9) using STAR (v.2.5.3). Reads on each gene were counted with fea- 
tureCounts (v.1.5.3) from the Subread package, and the edgeR package 
was used to identify the differentially expressed genes with a false dis- 
covery rate (FDR; q value) threshold of 0.05. The analysis was performed 
in genes that were expressed at >1 counts per million (CPM) in two or 
more samples, and generalized linear models were used to compare 
gene expression data. Gene set enrichment analysis (GSEA) (v.2.2.4) was 
used to determine the sets of genes that were differentially expressed in 
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NfkbizcKO mice compared with control mice. For GSEA, publicly avail- 
able gene sets, including downregulated genes in /l17rc-knockout cells® 
and downregulated genes in Nfkbiz-knockout cells” 3° °8, were used. 


Mouse model of cancer induced by AOM-DSS 

Nfkbiz“ Vill-cre (cKO) and Nfkbiz™ (control) littermates were co- 
housed after weaning. At 8 weeks of age, 12 mg per gram body weight 
of AOM (A5486, SIGMA) was injected into the peritoneum and 3 cycles 
of 2% DSS were administered (5 days DSS, followed by 16 days water). 
After completion of 3 cycles of DSS, mice were killed and the colons 
were opened longitudinally. The size and number of tumours in the 
colon were measured. Induced tumours were evaluated histologically 
by H&E staining, B-catenin and Ki-67 immunohistochemistry as previ- 
ously described”. Body weight, colon length and histological score of 
colitis were also measured as described above. 


Statistical analysis 

Statistical analyses were performed using R (v.3.5.1). All P values were 
calculated by two-sided analysis unless otherwise specified. The Stu- 
dent’s t-test, paired t-test, Fisher’s exact test or Mann-Whitney Utest was 
used for group comparisons. Multiple testing was corrected based on 
the Benjamini-Hochberg method. The linearity of the number of muta- 
tions in single crypts (n = 43) from 22 control individuals with age was 
evaluated on the basis of the Pearson’s correlation coefficient ina linear 
regression model that assumed zero intercept, because the number of 
somatic mutations among an exome regionat 0 years of age is assumed 
to be nearly O. By the direct observation of mutations without culturing 
cells, the estimated rate of mutation inthe colorectal epithelium was 0.61 
mutations per exome per year in control individuals, whichis compara- 
ble to that reported previously”. For the analysis of the rate of mutation 
of single crypts from 18 patients with UC, we included one single crypt 
per subject that showed the maximum number of mutations inthe same 
patients with UC and all crypts (n=43) from 22 control individuals. The 
rate of mutation of single crypts after UC onset was estimated by alinear 
regression model that incorporated both age of UC onset and disease 
duration in which control individuals were treated as having a disease 
duration of 0, which showed significant improvement from a model 
that incorporated age of individuals alone (by ANOVA) (Supplementary 
Table 13). For comparison between the left and right side of the colon, 
the analyses were confined to those samples from patients for whom 
samples were taken from both the left and right side of the colon. In 
each patient, mutations or MCFs were averaged for multiple samples 
inthe left and right side of the colon before performing a paired t-test. 
The significance of the difference of maximum MCF of isolated bulk- 
crypt samples between control individuals and patients with UC was 
tested by two-sided Mann-Whitney Utest. For the linearity of maximum 
MCF in bulk-crypt samples with UC duration, alinear regression model 
without assuming zero intercept was applied, because a small number 
of mutations with high MCF can be observed as somatic mosaicism even 
incontrol individuals. The significance of the correlation of UC duration 
with maximum MCF was tested using an F test. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All of the WES and RNA-seq data have been deposited in the 
European Genome-phenome Archive under accession numbers 


EGAS0O0001003801 and EGASO0001003802, respectively. Data for 
Figs. 1-4 and Extended Data Figs. 3-10 are available as Source Data. 
Allother data are available from the corresponding author on reason- 
able request. 
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Extended Data Fig. 1| Study design. a-c, Summary of sampling types, 
histology of samples, subjects and sequencing platforms for single crypts (a), 
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Extended Data Fig. 2| Sampling methods. a, For preparation of epithelial 
samples, after treating intestinal mucosa with 20 mM EDTA in PBSat 4 °C for 

20 min, the epithelium was dissociated from the lamina propria. Then, asingle 
crypt or epithelium (for bulk crypts) was collected. b, For geographical 
tracking of clones inacluster of single crypts in colorectal mucosa from 
control individuals, acrylic adhesive was put on the mucosal surface, followed 
by atreatment with 20 mM EDTA in PBS. After peeling off the epithelium upside 
down, single crypts that were attached to the adhesive were collected one by 
one under astereomicroscopeand their positions recorded (top). Toisolate 
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crypts from patients with UC, after colorectal mucosa was treated with 20 mM 
EDTA in PBS, a gentle pressure was applied to the whole mucosal tissue from 
both sides to pushup crypts, and each single crypt was isolated in a manner 
similar to that described above. c, For single-crypt sequencing, genomic DNA 
was extracted from single crypts by direct cell lysis and heat denaturation and 
split into two aliquots, each of which was subjected to WGA using ®29 DNA 
polymerase. One of the two WGA products was subjected to WES, and the other 
was stocked for validation. 
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WES in single crypts from control individuals (10 out of 43; a) and patients with crypts (n=58) from non-dysplastic UC samples. The number of mutations in 
UC (10 out of 58; b). c, Frequency of driver mutations among single crypts from each single crypt is shown above, and the disease duration and age of subjects 
control individuals (n= 43) and patients with UC (n=58). Genes witha are shown below. 

significant difference in mutation frequency between non-UC and UC crypts 
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images (left or top). The colours of the branches in the trees correspond to with UC (n=3, bottom). 


a Non-UC UC 


4 NFKBIZ 4 PIGR 4 ZC3H12A 4 TRAF3IP2 4 HNRNPF & ARID1A 4 KRAS 4 TP&3 
Colon segment: = Cecum ® Ascending ® Transverse ®™ Descending = Sigmoid = Rectum 


15 . i % pp ge aa 
| tT aEL LY if 
ae Lo EK 
0.754 © = 4 ‘ i. .3 i eS 
° ° ° ° g 8 8 
ts ° ° : 3 4 7 “3 fe 
9 0.5 5 ° ° o & aan ° o «8% Be % 0 e ° Seg 
= ° ° ° ° oe 2, ° 8 
°° 20 ° oof ° ° 8 f° o 
aie e ° on ng % Fy ag 
° ° °° ar) ge % Gg 
0.25 4 2° a rr a) 7 & P| 
: oe Seg 8, 08 8 om Seo om, & 10° U °8 
posers  sgese eg oP FPL cab ay esihsd 3 agi 
a ° > 0 
Colon segment a a | a A a a a a a a Se i 
NFKBIZ 
fee I | MUL I A I 
ZC3H12A ll lll 
TRAF3IP2 I Toll 
VARIDIA th I Tl A 
KRAS I | I 
TP53 | 
Other drivers Ww I ee Wa ial 
S 2 180+ 
5 2 120; ly 
pe Ud lov at bl 
ee ee eda a 
- 307 
o£ = 204 
a 8 pono! 
a "5! poeeeeeeeeleell 
100 5 
a TL 
ZS 
Me ee 
1205 
b © UC (n= 22) e © UC (n= 22) He rr ee dee 
© Non-UC (n= 23) 2 4, O Non-UC(n=23) a oO 
BS S& 1004 
2 100 5 ° ° % = 
S * 2a ° s 
2 nw , — 8 - E) £ 
£3 g0- ° ts aoe ° 77 E804 go 
35 ’ Es - 5 = 
EB ° cn" Bog 6 oe o 60 - g 
55 601 ona ea le @ te Og 7 = 0504 
Q ° °, = og ie] 
2 a Be of ° - 2 o 
5 404 oO, © gEz04ig -- ° 0 & 404 3 
58 oo Ee as : & = 0.25 | 
a g 8° R?= 0.46 8 2 
oo ’ R?=0.77 5 ° p=2.6 x 107 g 
< 0-8 T T T T ‘ 1 g O°, T T T T T 1 o- pia — —— lh < 0.00 - 
0 5 10 15 20 25 30 0 5 10 15 20 25 30 Right Left 
After disease onset (y) After disease onset (y) Side of colon Side of colon 
Extended Data Fig. 5 | Mutations detected in colorectal epithelium from WES of bulk-crypt samples within each individual is plotted against the 
control individuals and patients with UC. a, MCF of mutations that were duration of UC. Control individuals are plotted at zero onthex axis. Regression 


detected with WES inisolated bulk-crypt samples from non-UC (n=136) andUC _ lines with (b) or without (c) assuming zero intercept are shown with R’ values. 
(n=133) colorectal epithelia. Samples belonging to each subject are separated The calculated Pvalue inc represents the significance of the correlation 


by blank columns. MCFs of driver mutations are indicated by coloured between the duration of UC and the average value of maximum MCFs (F test). 
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Extended Data Fig. 6 | Functional analysis of driver genes in UC mucosa. 
a, Distribution of driver gene mutations in non-dysplastic UC samples 
(58 isolated single crypts and 399 bulk crypts, top) and CAC samples (n=99, 


bottom). Amino acid sequences of different species around hotspot mutations 
of HNRNPF are shown. Completely conserved amino acids across all species are 


indicated by asterisks. b, Expression levels of Nfkbiz in colon epithelial 


organoids (n=3 biological replicates) treated as indicated for1or 24h. Data are 


normalized by the levels of PBS (control) and shown as mean +s.d. Asterisks 
indicate P< 0.05 between the indicated treatment and treatment with PBS 


(two-sided Student's t-test). LTa1f2, lymphotoxin «1/B2; ODN, CpG 
oligodeoxynucleotides; OSM, oncostatin M.c, Immunoblot analysis of HCT116 
cells that express empty vector, Myc-tagged wild-type (WT) ZC3H12A or the 
indicated ZC3H12A mutants after stimulation with IL-1B for the indicated times, 
using anti-Myc and B-actin antibodies. d, Immunoblot analysis of ZC3H12A, 
Flag-tagged dominant-negative BTRC(AF) and B-actin in HeLa cells that express 
doxycycline-inducible BTRC(AF) after stimulation with IL-17A or IL-1B for the 
indicated times with or without doxycycline (Dox). Data are representative of 
two independent experiments. For gel source data, see Supplementary Fig. 1. 
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Extended Data Fig. 7 | Clonal expansion in UC rectums. a, Expansion of clones 
within the rectum from the remaining four out of the six analysed patients with 
UC (two are presented in Fig. 2d). Allof the representations follow those in 

Fig. 2d. Driver mutations including those found in the two samples in Fig. 2d are 
summarized in Supplementary Table 9. b, Area occupied by each clone and 
total number of mutations estimated by PyClone for each clone detected in the 
UC rectums. Colours indicate the highest pathological grade within each clone. 
c-h, Phylogenetic trees of representative clones (6 out of 41) that were found in 


two or more rectum samples surgically obtained from six patients with UC. 
Imputed clonal structures are depicted in violin plots at the bottom, in which 
the estimated cellular prevalence of imputed mutational clusters (x axis) are 
shownas the distribution of posterior probabilities (width of the violin plots) 
calculated according to a PyClone model. Colours correspond to those inthe 
associated branches of trees. The estimated time of UC onset is indicated on 
the basis of the mutation rate in non-UC crypts (Fig. 1a). Branches that contain 
driver mutations are also indicated. 
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Extended Data Fig. 8 | Effect of Nfkbiz on clonal expansion in a mouse model 
of chronic DSS-induced colitis. a, Experimental schedule for induction of 
colitis in R26'°2"Nfkbiz”™ Vill-cre® (control) and R26°°2"Nfkbiz“"'Vill-cre™® 
(Nfkbiz cKO) mice. Cre-mediated recombination was induced by 
administration of tamoxifen (TAM), followed by induction of colitis by 
treatment with DSS. b, e, Images of whole-mount X-gal staining of the large 
intestine from control (top) and Nfkbiz cKO (bottom) mice after tamoxifen 
administration without (b) or with (e) subsequent treatment with DSS. 

c,f, Fraction of LacZ-positive areas in the middle to distal colon before (c) or 
after (f) induction of colitis with DSS according to genotypes. Two-sided 
Mann-Whitney Utest. d, Density of LacZ-positive areas (number per cm?) 
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having indicated sizes in control and Nfkbiz cKO mice before induction of 
colitis with DSS. g, Colon length (left) and histological score (right) after 
chronic DSS-induced colitis for the evaluation of severity of colitis. Two-sided 
Mann-Whitney Utest. h, Relative change in weight of NfkbizcKO and control 
mice during three courses of DSS-induced colitis. Data are shown as mean +s.d. 
i, Significant enrichment of NFKBIZ target genes (left) and genes associated 
with the IL-17 signalling pathway (right) among the downregulated genes in 
rectum epithelium from Nfkbiz cKO mice compared with control mice four 
days after the start of treatment with DSS. Data were generated fromn=3 


biologically independent mice for both genotypes. Two-sided nominal Pvalues 
were calculated by GSEA. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | Mutational profiles and CNAs in control individuals 
and patients with UC. a, Landscape of driver mutations in CAC samples (n=99) 
and dysplastic UC samples (n= 13). Genes with asterisks are drivers in CACs 
(Methods). CNAs of known oncogenes are depicted only for the 34 CACs that 
were newly analysed in this study. b, Frequency of driver mutations among 
clones identified in non-dysplastic samples from patients with UC (n=183) 
versus SCRC samples from the Japanese cohort (n= 525). Asterisks indicate 

q< 0.1 (two-sided Fisher’s exact test with Benjamini-Hochberg adjustment). 

c, Number (middle) and relative frequency (bottom) of mutations allocated to 
signatures A and Binsamples of single crypts from control individuals (n= 43) 
and non-dysplastic (n= 241; 58 single crypts and 183 clones), dysplastic 

(n=13 clones) and non-hypermutated CAC (n=92 samples) from patients with 
UC. Information about disease state, pathology and sample type is indicated at 
the top. d, Comparison of the number of mutations detected by WESinclones 


identified innon-dysplastic samples, dysplastic samples and CAC samples 
without hypermutation from patients with UC. Two-sided Mann-Whitney U 
test.e, Mutational signatures of colorectal epithelium from control individuals 
and patients with UC. f, Fraction of signature B mutations in single crypts from 
control individuals, and clones identified in non-dysplastic (ND), dysplastic (D) 
and non-hypermutated CAC samples containing > 10 mutations from patients 
with UC. Two-sided Mann-Whitney Utest. g, Box plots of the fractions of 
genomic regions that contain CNAs in clones identified in non-dysplastic 
(n=183), dysplastic (n=13) and CAC (n=34) samples from patients with UC. 
Two-sided Mann-Whitney Utest. h, Bottom, colour-gradient maps of CNAs and 
UPDs as detected by WES data are shown for clones identified in non-dysplastic 
(n=183), dysplastic (n=13) and CAC (n=34) samples from patients with UC. 
Top, pathological grade, fractions of genomes that exhibit CNAs and UPDs and 
TP53 mutation status are also shown. Box plots as in Fig. 4. 


pL-CRISPR.EFS.GFP 


PCR primers 
~alp> rou) ePs> sncasop2ntecre— 


Cancer cell line 


206° Sort G6 


After 4 weeks 


@ 
O62 
@6 


GFP-positive cells 


se Culture @ 


@ 


‘Se? e%e° Ho? 
Control #1-4 ~S25=== Pome 
(non-targeting)  *S2 =. ee library DN le: sie 
=. © C6 o5 Infection | PCR J PCR 
————— O62 62 (MOI 0.2) s F s i 
sgNFKBIZ #1-6 Bs ch ree equencing equencing 
>_> = 
>_> 
Comparison 
Bb AOM 12mg/q i.p. @ 4.45 ~*~ Control (n = 25) 
as —e— Nfkbiz cKO (n= 25) 
a 
oO 
S 
> 
3 
a 
g 
© p=4.9x 10% = Control (n = 25) s 
26 6 ‘ = Nfkbiz cKO (n = 25) oc 
Bf © 
244 |! 0 @ ~ p=1.2* 10% 
. —" a After starting DSS (w) 
£2 ! 
2 HE B-catenin Ki-67 


a: a. f 
cea a a 


aa a i 


Control 
d p=0.17 p=041 
9 : é 
= @ 
5 Ce Seo § gs 
= 9 28 a ° 
DB °@ 3 ° 
c 7 °o@ 2 
2 oo 8 4 j 
5 op 3 og oo Nfkbiz cKO 
fo} 9 0 2 ® § OG 
6) = a) ood 
5 0 
Control Nfkbiz cKO Control Nfkbiz cKO 
(n= 25) (n = 25) (n=11) (n=11) 


Extended Data Fig. 10 | Effect of NFKBIZ knockout in CRC celllines and an 
AOM-DSS-induced colorectal tumour model. a, Experimental strategy for 
enrichment analysis using CRISPR-Cas9 (Methods). MOI, multiplicity of 
infection. b, Experimental schedule for induction of CAC in Nfkbiz“Vill-cre 
(Nfkbiz cKO) and Nfkbiz™ (control) mice using AOM followed by three 

cycles of DSS. c, Box plots of tumour numbers relative to tumour size 
(maximum diameter). Two-sided Mann-Whitney Utest. d, Colon length (left) 
and histological score (right) for the evaluation of inflammation in the AOM- 
DSS-induced tumour model. No significant difference was observed between 


NfkbizcKO and control mice (two-sided Mann-Whitney Utest). e, Relative 
change in weight of mice during three courses of treatment with DSS in 

the AOM-DSS-induced tumour model. Data are shown as mean +s.d. 

f, Histological images of haematoxylin and eosin (H&E) staining (left), and 
immunohistochemistry for B-catenin (middle) and Ki-67 (right). Tumours in 
NfkbizcKO and control mice were indistinguishable in terms of histology and 
immunochemistry. Representative images are shown fromn=3 biologically 
independent samples for both genotypes. Box plots as in Fig. 4. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


[| A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


Oo For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


[ ] Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection We enrolled a total of 647 subjects who underwent surgery or endoscopy of the lower gastrointestinal tract at Kyoto University Hospital, 
Hyogo College of Medicine Hospital, and Cancer Institute Hospital. Among these, 76 had been diagnosed as ulcerative colitis and the 
remaining 571 were diagnosed as other disease such as colorectal polyp or cancer without colitis, or were turned out to be healthy 
individuals. This study was approved by the ethical committees at Kyoto University, Hyogo College of Medicine, and Cancer Institute 
Hospital. Informed consent was obtained from all participants. As for external data sets, whole exome sequencing (WES) data of paired 
tumor/germline control samples from CRC (n=356) patients were downloaded from the TCGA Data Portal (https:// 


portal.gdc.cancer.gov/). Four previously published mutation lists of WES obtained from 65 colitis-accociated cancer in UC individuals 
were also analysed. 


Data analysis Detailed descriptions of the software and analysis have been provided in Online Methods. Sequencing data was processed using our in- 
house pipeline Genomonz. External bam files were converted to fastq format using biobambam2. Copy number analysis using 
sequencing data was performed by in-house pipeline CNACS. Significantly mutated genes were identified using dNdScv and MutSigCVv. 
utational signature was evaluated using pmsignature and deconstructSigs. Clonal architecture was estimated by PyClone (0.13.0). 
Phylogenetic analysis was performed by using MEGA X (v10.0.5). Transcriptome analysis was performed by STAR (v2.5.3.), featureCounts 
(v1.5.3), edgeR (v3.24.3), and GSEA (v2.2.4). Statistical analyses were performed using R (3.5.1). Images were analyzed by using ImageJ 
(1.52a). 


List of programs and softwares: 

Genomonz2 pipeline: version 2.6.0 (https://genomon.readthedocs.io/ja/latest) 

- Burrows-Wheeler Aligner: version 0.7.10 (https://sourceforge.net/projects/bio-bwa/) 

- picard-tools: version 1.39 (http://picard.sourceforge.net/) 

- biobambam2: version 2.0.85 (https://www.sanger.ac.uk/science/tools/biobambam) 

- GenomonMutationFilter: (https://github.com/Genomon-Project/GenomonMutationFilter) 

- EBCall: version 2 (https://github.com/friend1ws/EBCall) 

Integrative Genomics Viewer (IGV): version 2.4.6 (http://software.broadinstitute.org/software/igv/) 
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MutSigCVv: version 1.4. (http://software. broadinstitute.org/cancer/software/genepattern/modules/docs/MutSigCV) 

dNdScv: version 0.0.1 (https://github.com/im3sanger/dndscv) 

CNACS: (http://plaza.umin.ac.jp/kyoto_tumorpatho/CNACS.html) 

pmsignature: version 0.3.0 (https://github.com/friend1ws/pmsignature) 

deconstructSigs: version 1.8.0 (https://github.com/raerose01/deconstructSigs) 

PyClone: version 0.13.0 (https://github.com/aroth85/pyclone) 

MEGA X: version 10.0.5 (https://www.megasoftware.net/home) 
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For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


All the WES and RNA-seq data have been deposited in the European Genome-Phenome Archive (http://www.ebi.ac.uk/ega/) under accession numbers 
EGAS00001003801 and EGASO0001003802, respectively. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No statistical methods were used to determine sample size since this is an exploratory study. We enrolled as many patients as possible who 
provided consent for our study during the enrollment period between 1/Oct/2014 and 31/Mar/2019. Cryo-preserved colorectal cancer 
samples were provided from Cancer Institute Hospital. In total, we collected 1568 samples from 76 UC patients and 571 healthy or non-UC 
individuals. The sample size of our study was sufficient to confirm novel findings with statistical significance. As for external data sets, 356 
sporadic CRCs (sCRCs) and 65 colitis-associated cancers were also analysed. 


Data exclusions — For the analysis of TCGA sCRC samples, we excluded whole-genome amplified samples to avoid artefactual mutation callings. We also 
excluded hypermutated TCGA sCRCs from the analysis of comparison of driver genes between UC non-dysplasia and TCGA sCRCs because high 
background mutation rate of the tumours might make the difference difficult to detect. We also enrolled only non-hypermutated sCRCs from 
the Japanese cohort for the same reason. 


Replication Three or more biological replicates were generated for every assay, except for immunoblot experiments where two biological replicates were 
generated and the same results were confirmed among two replicates. We tested significant difference for every assay as described in the 
figure legends and Methods. 


Randomization — Not applicable since this is a case-series study which was therefore not planned to detect any difference in effects between the cohorts with 
and without intervention. 


Blinding Two expert pathologists, blinded to clinical information, reviewed all the specimens obtained from UC patients. Researchers were blinded to 
genotype information at measurement of each animal experiment. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


=) 
ied) 
ay 
S 
o 
= 
o 
Za) 
© 
jad) 
= 
(@) 
= 
= 
io 
70) 
fe) 
= 
=} 
a 
Wn 
= 
S| 
3 
fed) 
5 
< 


Materials & experimental systems Methods 


n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Monoclonal anti-c-Myc (supplier name, Sigma-Aldrich; catalogue number, M4439; clone name, 9E10; lot number, 026M4825V; 
dilution, 1/2000) 
Anti-mouse IgG HRP (supplier name, GE Healthcare; catalogue number, NA9310-1ML; lot number, 16919626; dilution, 1/5000) 
Polyclonal anti-ZC3H12A (supplier name, Atlas Antibodies; catalogue number, HPAO32053; lot number, R32479; dilution, 1/500) 
Polyclonal anti-Flag (supplier name, Sigma-Aldrich; catalogue number, F7425; lot number, O78M4886V; dilution, 1/2000) 
Monoclonal anti-beta-Actin (supplier name, Santa Cruz; catalogue number, sc-47778 HRP; clone name, C4; lot number, K1418; 
dilution, 1/1000) 
Anti-rabbit IgG HRP (supplier name, GE healthcare; catalogue number, NA9340V; lot number, 16926347; dilution, 1/5000) 
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Validation Monoclonal anti-c-Myc (supplier name, Sigma-Aldrich; catalogue number, M4439) — immunoblotting — human cells; Validation: 
Manufacturer - https://www.sigmaaldrich.com/catalog/product/sigma/m4439 
Polyclonal anti-ZC3H12A (supplier name, Atlas Antibodies; catalogue number, HPAO32053) — immunoblotting — human cells; 
Validation: Manufacturer - https://www.atlasantibodies.com/products/antibodies/primary-antibodies/triple-a-polyclonals/ 
zc3h12a-antibody-hpa032053/ 
Polyclonal anti-Flag (supplier name, MERCK; catalogue number, F7425) — immunoblotting — human cells; Validation: 
Manufacturer - https://www.sigmaaldrich.com/catalog/product/sigma/f7425 
Monoclonal anti-beta-Actin (supplier name, Santa Cruz; catalogue number, sc-47778 HRP) — immunoblotting — human cells; 
Validation: Manufacturer - https://datasheets.scbt.com/sc-47778.pdf 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) HT29, SW480, DLD1, HCT116, LoVo, and HeLa cells were obtained from ATCC. 
Authentication None of the cell lines were authenticated. 
Mycoplasma contamination All cell lines used for this study were tested negative for mycoplasma using Myco Alert Mycoplasma Detection Kit (Lonza). 


Commonly misidentified lines No commonly misidentified cell lines were used in this study. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals VilCre mice (004586) and R26-LacZ mice (003309) were obtained from The Jackson Laboratory. Nfkbiz flox mice (RBRC06410) 
were obtained from Riken BRC. VilCreER mice were kindly gifted by Sylvie Robine. All mice used in this study were C57BL/6 
strains. 

In the analysis of clonal expansion in chronic DSS colitis, we analysed 11 R26-LacZ/wt;Nfkbiz wt/wt;VilCreER (control) mice (5 
male and 6 female) and 10 R26-LacZ/wt;Nfkbiz fl/fl;VilCreER (Nfkbiz cKO) mice (5 male and 5 female) for the pre-colitis 
assessment. We also analysed 20 control mice (13 male and 7 female) and 21 Nfkbiz cKO mice (14 male and 7 female) for the 
post-colitis assessment. All mice were injected with tamoxifen at 6 weeks of age. 

In the transcriptome analysis of DSS-induced colitis, 3 male Nfkbiz fl/fl (control) mice and 3 male Nfkbiz fl/fl;VilCre (Nfkbiz cKO) 
mice were analysed at 8 weeks of age. 

In the AOM-DSS tumour model, we started experiments with 25 Nfkbiz fl/fl (control) mice (18 male and 7 female) and 25 Nfkbiz 
fl/fl;VilCre (Nfkbiz cKO) mice (18 male and 7 female) at 8 weeks of age. 


Wild animals This study did not involve wild animals 
Field-collected samples This study did not involve field-collected samples 
Ethics oversight Allin vivo experiments in animals were approved by Kyoto University Animal Care and Use Committee. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Population characteristics Participants were recruited through at Kyoto University, Hyogo College of Medicine, and Cancer Institute Hospital. We enrolled a 
total of 647 patients who underwent surgery or endoscopy of the lower gastrointestinal tract at Kyoto University Hospital, Hyogo 
College of Medicine Hospital, and Cancer Institute Hospital. Among these, 76 had been diagnosed as ulcerative colitis (UC) and 
the remaining 571 (healthy or non-UC) were diagnosed as other disease such as colorectal polyp or cancer without associated 
colitis or turned out to be healthy individuals. The detailed characteristics of cases were provided in Supplementary Table 1. 


Recruitment Patients who underwent surgery or endoscopy of the lower gastrointestinal tract between 1/Oct/2014 and 31/Mar/2019 at 
Kyoto University and Hyogo College of Medicine were recruited without any selection. Cryo-preserved non-hypermutated sCRC 
samples between 1/Dec/2006 and 31/May/2016 were provided from Cancer Institute Hospital without any selection. 


Ethics oversight This study was approved by the ethical committees at Kyoto University, Hyogo College of Medicine, and Cancer Institute 
Hospital. Informed consent was obtained from all participants. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Acute myeloid leukaemia (AML) is a heterogeneous disease characterized by 
transcriptional dysregulation that results in a block in differentiation and increased 
malignant self-renewal. Various epigenetic therapies aimed at reversing these 
hallmarks of AML have progressed into clinical trials, but most show only modest 
efficacy owing to an inability to effectively eradicate leukaemia stem cells (LSCs)'. 
Here, to specifically identify novel dependencies in LSCs, we screened a bespoke 
library of small hairpin RNAs that target chromatin regulators in a unique ex vivo 
mouse model of LSCs. We identify the MYST acetyltransferase HBO] (also known as 
KAT7 or MYST2) and several known members of the HBO1 protein complex as critical 
regulators of LSC maintenance. Using CRISPR domain screening and quantitative 
mass spectrometry, we identified the histone acetyltransferase domain of HBO1 as 
being essential in the acetylation of histone H3 at K14. H3 acetylated at K14 (H3K14ac) 
facilitates the processivity of RNA polymerase II to maintain the high expression of 
key genes (including Hoxa9 and Hoxa10) that help to sustain the functional properties 
of LSCs. To leverage this dependency therapeutically, we developed a highly potent 
small-molecule inhibitor of HBO1 and demonstrate its mode of activity asa 
competitive analogue of acetyl-CoA. Inhibition of HBO1 phenocopied our genetic 
data and showed efficacy in a broad range of human cell lines and primary AML cells 
from patients. These biological, structural and chemical insights into a therapeutic 
target in AML will enable the clinical translation of these findings. 


AMLis organized in a loose hierarchy, in which a small population of 
self-renewing LSCs give rise to alarge population of more-mature leu- 
kaemic blasts”. Although several human and mouse AML cell lines have 
undergone chemical and genetic screens to identify targetable depend- 
encies in this disease’ ®, the majority of these models do not replicate 
the functional properties of LSCs. Analogous to the effective mainte- 
nance of embryonic stem cells with therapeutic pressure to decrease 
differentiation®, amethod to sustain cells with the transcriptional and 


functional properties of LSCs in liquid culture has previously been 
established’. Importantly, we concurrently established an isogenic 
population of AML blasts. 

Because regulators of transcription are the most frequent muta- 
tional targets in AML, we performed a pooled negative-selection 
screen with a customized small hairpin RNA (shRNA) library against 
270 known chromatin modifiers to identify transcriptional regulators 
that are required for the maintenance of functionally validated LSCs’. 
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Fig. 1| HBO1is an essential dependency inLSCs. a, Pooled negative-selection 
screening in MLL-AF9 bulk blasts (left) and LSCs (right). Volcano plot depicts 
the changes in representation of shRNAs in the screen between day 2 and 

day 14. Each dot represents the mean of two independent experiments fora 
gene. b, Negative-selection competition assays, depicting the percentage over 
time of blasts and LSCs that express Hbol shRNAs. n=3 experiments. 

Mean +s.e.m. NT, non-targeting shRNA. c-e, Immunoblot of HBO1, H3K14ac 
and H3K27ac (n=3) (c), negative-selection competition assays (n=3; 

mean +s.e.m.) (d) and immunoblot of H3K14acin LSCs that express shRNAS 
that target members of the Hbol complex (n =3) (e). f, g, Negative-selection 
competition assays (n=3; mean+s.e.m.) (f) and immunoblot of HBOI, H3K14ac 
and H3K27ac (representative of n=3 biological replicates) (g) in LSCs that 
express Cas9 and sgRNAs against Rosa26 (control) or the Hbol catalytic 
domain. h, Schematic outline of HBO1 rescue experiments. Wild-type (WT) 
Hbol, catalytic-mutant Hbol®*°*? (resistant to sgRNA against Hbol e12.2) or GFP 
were overexpressed in LSCs that express Cas9, and then transduced with 
sgRNAs targeting Hbol.i, HBO1-rescue negative-selection competition assays 
(n=3;mean+s.e.m.). SgRNA against Hbol e1l1.1 targets endogenous (end) and 
overexpressed (OE) Hbo1. Right, immunoblot of overexpressed wild-type and 
catalytic-mutant HBO1. 


The screen was highly reproducible, and clearly identified shared and 
unique dependencies in LSCs and AML blasts (Extended Data Fig. 1a). 
We observed far fewer dependencies in LSCs: less than one-third of 
the shRNAs were depleted in the LSCs compared to the blasts (Fig. 1a, 
Supplementary Table 1). It has previously been shown that the BET bro- 
modomain proteins (BRD2, BRD3 and BRD4) were not a major depend- 
ency in this LSC model’, and—in addition—we found that most of the 
hitherto-identified epigenetic dependencies (including DOTIL, LSD1, 
EZH2 and PRMTS) that have been the focus of clinical therapies! selec- 
tively eradicate only the blasts and not the LSCs (Extended Data Fig. 1b, 
c). Of the few dependencies identified in the LSCs, we chose to focus on 
Hbo1 (also knownas Kat7) as it is not arecognized essential gene and was 
equally effective in eradicating the blast and LSC populations (Fig. 1a, 
b, Extended Data Fig. 1d). HBO1is one of five mammalian members of 
the highly conserved MYST family of acetyltransferases. Recent efforts 
to identify unique and global genetic dependencies in human cells 
have highlighted the fact that MOF (also known as KATS) and TIP60 
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Fig. 2|Loss of HBO1 impairs LSC maintenance in vivo. a, b, Cell-cycle profile 
(n=3;mean+s.e.m.) (a) and surface expression of GRland CD11b (n=3) (b)in 
LSCs that express Cas9, transduced with Hbol sgRNAs. c, Percentage of shRNA- 
positive cells in bone marrow (BM) and spleen (SP) at end point, in recipients 
transplanted with quinary MLL-AF9 cells. Mean+s.d.n=5 mice per group. 

d, Percentage of shRNA-positive cells (mean +s.e.m.) in bone marrowat end 
point, in recipients transplanted with tertiary NPMIc and FLT3-ITD cells.n=6 
mice per group. e, Kaplan-Meier curves of recipients injected with NPMI1c and 
FLT3-ITD cells that express shRNA targeting Hbol.n=5 mice per group. Inset, 
immunoblot of HBO1in NPMiIc and FLT3-ITD cells expressing Hbo1 shRNA from 
arecipient that died showed that the hairpin was inactivated. Hbol levels from 
parental NPMIc and FLT3-ITD cells are shown for comparison. f, Kaplan-Meier 
curves of C57BL/6 mice injected with 10? Hbol*#""* Mx1-cre primary MLL-AF9 
leukaemic cells. Mice were treated with saline or pIpC (arrowheads). Inset, 
representative genotyping of recipient bone marrowat end point. n=6 mice 
per group. g-k, Negative-selection competition assays (n=3; mean +s.e.m.) 
(g), immunoblot of HBO1and H3K14ac (n= 3) (h), cell-cycle profile (n=3; 

mean +s.e.m.) (i), apoptosis (n =3; mean +s.e.m.) (j) and surface expression of 
CD11b (n=3) (k) in Molm13 cells that express Cas9 and HBO1 sgRNAs. Safe3 
guide was used asa negative control. 


(also knownas KAT5) are pan-essential genes’, whereas HBO1 is highly 
expressed in human AML (Extended Data Fig. 2), in which it shows a 
clear and unique dependency’ (Fig. 1b, c, Extended Data Fig. 3a, b). 
HBO1 has previously been reported to function as a major transcrip- 
tional regulator, primarily via histone acetylation, and—although vari- 
ous histone modifications have been attributed to HBO1° *—these 
conflicting reports are probably influenced by the specificity of the 
antibodies used. Therefore, to precisely identify the major histone 
modifications regulated by HBOI, we coupled conditional deletion 
of HBO1in AML cells with quantitative mass spectrometry”. These 
data clearly demonstrate that the acetylation of histone H3 at K14 is 
the major non-redundant chromatin modification that is mediated by 
HBO1 (Fig. Ic, Supplementary Table 2). 

Similar to most histone acetyltransferases, HBO1 can interact with 
several scaffolding proteins to form functionally distinct catalytically 
active complexes". Therefore, to identify the major complex members 
required for the maintenance of LSCs, we assessed the functional effect 
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Fig.3|HBO1regulates expression of the 5’ end of the HOXA cluster in AML. 
a, Heat map depicting normalized expression of the top-25 downregulated 
genes in LSCs with HBO1 loss. b, Heat map displaying the mean spectral counts 
from affinity-purification mass spectrometry data of seven MLL1 translocation 
partners for all members of the MYST family. c, Network analysis of known 
protein interaction partners of HBO1 (thick grey lines), linking to known 
members of the HBO1 complex and HBO1 network to seven MLL fusions (thin 
red line). Node size represents the mean spectral counts for each interaction. 
The width of the node border is scaled to the number of interactions with the 
MLL fusions. d, ChIP-seq profiles of H3K14ac and RNA Pol-II at the HOXA gene 
cluster in LSCs that express Cas9 and Hbol sgRNA. Representative of 


of genetic depletion of the members of the HBO1 complex. We found 
that knockdown of BRPF2 (also known as BRD1), MEAF6 and PHF16 (also 
known asJADE3) phenocopied the functional and biochemical effects 
of HBO1 reduction (Fig. 1d, e, Extended Data Fig. 3c). As all of the com- 
ponents of the HBO1 complex that resulted in impaired maintenance of 
LSCs also caused a global reduction in H3K14ac, we reasoned that the 
catalytic domain of HBO1 might be the critical target. CRISPR domain 
screening with five separate single-guide RNAs (sgRNAs) against the 
MYST domain confirmed that this domain was essential for H3K14ac 
and LSC survival (Fig. 1f, g). Although CRISPR domain screening is an 
effective approach to identifying functional domains for drug discov- 
ery”, our rescue experiments with wild-type and catalytically inactive 
HBO1 provided the highest level of confidence that the histone acetyl- 
transferase domain of HBOI was the critical therapeutic target in the 
complex (Fig. 1h, i). 

The dominant cellular phenotypes that resulted from the loss of HBO1 
included an induction of apoptosis, a prominent GO/G1 cell-cycle arrest 
anda marked differentiation of the immature LSC population (Fig. 2a, 
b, Extended Data Fig. 3d-h). These data highlighted the importance of 
HBO1in LSC maintenance in an ex vivo model system. To address the 
broader application of our findings in the absence of therapeutic pres- 
sure to maintain the LSC state’, we generated an enriched population 
of LSCsinan in vivo mouse model” and performed a competition assay 
to assess the requirement of HBO1 for LSC maintenance in vivo. Here, 
we transplanted a fixed ratio of 90% shRNA-expressing cells and fol- 
lowed the percentage of shRNA-expressing cells that contributed to the 
leukaemia in vivo. Despite 90% of LSCs expressing Hbo1 shRNA being 
transplanted, less than 5% of them remain at the time of death from 


268 | Nature | Vol577 | 9 January 2020 


n=3 biological replicates. e, RNA Pol-Il coverage across highly expressed genes 
(high) divided according to H3K14ac levels. TSS, transcription start site. 

f, Waterfall plots of change in RNA Pol-II (fold change, expressed inlog;, (LFC) 
of RNA Pol-II) binding throughout the gene promoter (-30 to +300 bp) and 
gene body (+300 bp totranscription end site (TES)) following HBO1 loss 
(knockout (KO)) for highly expressed genes, divided according to H3K14ac 
levels. g,h, Negative-selection competition assays (n =3; mean+s.e.m.) and 
Hoxa9 and Hoxa10 mRNA expression (n=3; mean+s.e.m.) (h) in LSCs that 
express Cas9 and Smarca4, Dpf2 or SmarcaS sgRNAs. i, Surface expression of 
GR1land CD11b in LSCs that express Cas9 and Dpf2sgRNAs.n=2. 


leukaemia—demonstrating a marked negative selection (Fig. 2c). By 
contrast, both the non-targeting shRNA and shRNA against the closely 
related member of the MYST family Moz (also known as Kat6a) show no 
detrimental effect to LSCs. Similar results were also seen in the NPM1c 
and FLT3-ITD mouse model” (Fig. 2d). Moreover, the mice transplanted 
with shRNAs against Hbol showed a substantial survival benefit (Fig. 2e, 
Extended Data Fig. 4a), raising the prospect that Hbo1-null LSCs are 
incapable of perpetuating the disease. 

To explore this possibility further, we generated leukaemias using 
an MX1-Cre model for the conditional deletion of Hbo1 (Extended 
Data Fig. 4b). The resulting leukaemia was then transplanted into sec- 
ondary recipient mice and polyinosinic:-polycytidylic acid (plpC) was 
administered after engraftment. Neither pIpC injection nor heterozy- 
gous deletion of Hbo1 affect survival or leukaemia latency (Extended 
Data Fig. 4c). Leukaemic cells derived from Hbol”"« Mx1-cre mice 
show a marked survival advantage, and none of the fatal leukaemia 
that occurred in pIpC-treated mice showed complete loss of HBO1 
(Fig. 2f). By contrast, homozygous deletion of Moz showed no effects 
on survival in two separate mouse models of AML (Extended Data 
Fig. 4d-f). Together, these data confirm the results from our ex vivo 
model, and provide compelling evidence that HBO1 is an essential 
requirement for LSC maintenance. 

To assess the generality of our findings beyond mouse models of 
AML, we chose to delete HBO1 using CRISPR-Cas9 in a range of human 
AMLcelllines that encompasses a variety of oncogenic drivers prevalent 
in AML®. We found the majority of AML cell lines recapitulate our results 
inthe mouse LSCs, and show an impaired survival of HBO1-deleted cells 
that results from an induction of apoptosis, a GO/G1 cell-cycle arrest 
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Fig. 4 | Treatment with WM-3835 reduces AML growth. a, Chemical structure 
of WM-3835. b, Selectivity and potency profile of WM-3835 and WM-1119, 
superimposed onthe KAT family dendrogram, as measured by a histone 
acetyltransferase inhibition assay. Values are listed in Extended Data Fig. 8a. 

c, Ribbon representation of the HBO1-BPRF2 crystal structure (cyan) with WM- 
3835 bound (yellow with element colouring; RCSB Protein Data Bank code 
(PDB) 6MAJ) overlaid with the corresponding region of MYST“ with WM-1119 
bound (magenta with element colouring; PDB 6BA4), in which the non- 
conserved residues of MYST“’*" are shown in (blue). The key residues 
highlighted showa hydrogen bond with the phenol of WM-3835. d, Immunoblot 
of H3K14ac in LSCs treated with WM-3835. n=2.e, LSCs and human AML cell 


and prominent differentiation (Fig. 2g—k, Extended Data Fig. 5). By con- 
trast, very fewnon-AML cell lines showa similar dependency on HBO1 
(Extended Data Fig. 6). Having established the requirement of HBO1in 
mouse and human models of AML, we next wanted to understand the 
molecular events that underpin the role of HBO1in LSC maintenance. 
Consistent with the major cellular phenotype of myeloid differentia- 
tion, we found that HBO1 loss results in the marked enrichment of a 
myeloid differentiation gene-expression program (Extended Data 
Fig. 7a). The established role of HBO1 as a facilitator of transcription 
led us to examine the top downregulated genes after HBOI deletion. 
These downregulated genes are some of the most highly expressed 
(Extended Data Fig. 7b) and include several homeobox genes (Fig. 3a), 
which are known to be important in LSC maintenance and are com- 
monly upregulated in AML with a poor prognosis”. The requirement 
of HBO1to sustain the expression of the essential LSC genes within the 
5’-HOXA gene cluster is conserved in human AML cells (Extended Data 
Fig. 7c), and the dominant role of these genes in mediating the cellular 
phenotypes of HBO1 loss is highlighted by the fact that overexpression 
of Hoxa9 or Hoxal0 considerably rescues the myeloid differentiation 
and loss of viability that is observed after depletion of HBO1 (Extended 
Data Fig. 7d-f). 


proliferation with 1 1M WM-3835.n=3.Mean+s.e.m. f. Proliferation of LSCs 
with Moz or Qkf (also known as Kat6b and Morf) deletion with WM-3835.n=3. 
Mean +s.e.m. Right, immunoblot of H3K23ac and H3K14ac in LSCs with Moz or 
Qkf deletion. n=2.g,h, Cell-cycle profile (n=3; mean +s.e.m.) (g) and surface 
expression of CD11b (n =3) (h) in Molm13 cells with WM-3835. i. Hoxa9 
expression in mouse LSCs and HOXA9 expression in Molm13 cells treated with 
WM-3835.n=2.j, Clonogenic assays with primary AML cells from patients, 
treated with WM-3835.n=5.Mean+s.e.m. Gene symbols in parentheses 
denote mutated myeloid genes. k, Schematic overview of AML gene regulation 
by the HBO1 complex. 


Many of the genes that are downregulated after loss of HBO1— 
particularly the homeobox genes~are established targets of both wild- 
type MLL1and MLL1 fusion proteins”. Using quantitative proteomics 
in an isogenic leukaemia cell line designed to express a single copy 
of seven distinct MLLI fusion proteins”, we identified members of 
the HBO1 complex that are functionally required to maintain LSCs 
(Fig. 1d) as astronginteractors with the N terminus of MLL1 (Fig. 3b, c). 
Although these findings provide molecular insights into how the HBO1 
complex is recruited to specific gene loci, to further understand the 
role of HBO1in regulating these genes we performed chromatin immu- 
noprecipitation with sequencing (ChIP-seq) analyses for H3K14ac 
and RNA polymerase II (RNA Pol-II). These data show that H3K14ac 
deposited by HBO1 is widespread throughout the genome, but that 
at the highly expressed genes repressed by loss of HBO1, H3K14ac 
and RNA Pol-II blanket the entire coding region of the gene (Fig. 3d, 
Extended Data Fig. 7g). H3K14ac is an evolutionarily conserved histone 
modification, and recent evidence suggests that H3K14ac may regu- 
late transcriptional elongation”. Consistent with this, we find mark- 
edly increased levels of RNA Pol-II within the coding region of highly 
expressed genes that contain the highest levels of H3K14ac (Fig. 3e). 
Furthermore, expressed genes with the highest level of H3K14ac have 
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the lowest RNA Pol-II travelling ratio, and loss of HBO1 leads to a more 
prominent loss of RNA Pol-II within the body of these genes (Fig. 3f, 
Extended Data Fig. 7h). The processivity of RNA Pol-Ilis greatly facili- 
tated by chromatin remodelling complexes, and H3K14ac has previ- 
ously been shown to be specifically bound by SMARCA4”, DPF2” and 
members of the ISWI family”, resulting in the marked potentiation of 
their remodelling activity. The members of these chromatin remodel- 
ling complexes show a similar cancer-cell-line dependency profile to 
HBO1 (witha predilection for AML’) and also phenocopy the effects of 
HBO1 loss in LSCs (Fig. 3g-i, Extended Data Fig. 7i). 

Our genetic data in both mouse and human AML cells clearly identi- 
fied the catalytic activity of HBO1 as the central therapeutic target. A 
long-standing challenge in the field has been to develop highly selective 
small-molecule histone-acetyltransferase inhibitors that discriminate 
between the major families of histone acetyltransferases. It has recently 
been demonstrated that the acylsulfonylhydrazide backbone provides 
asimple chemical scaffold for the generation of selective inhibitors 
of MYST-family acetyltransferases*. Using this template, we gener- 
ated WM-3835 (N’-(4-fluoro-5-methyl-[1,1’-biphenyl]-3-carbonyl)-3- 
hydroxybenzenesulfonohydrazide) (Fig. 4a), which retains specificity 
for the MYST acetyltransferases but has increased potency against 
HBO1 compared to WM-1119 (Fig. 4b, Extended Data Fig. 8a). We solved 
the crystal structure of HBO1 with WM-3835 bound in the acetyl-CoA 
binding site at 2.14 A (Fig. 4c, Extended Data Fig. 9). Overlaying this 
crystal structure with WM-1119 in complex witha modified MYST acetyl- 
transferase domain® (MYST°*"S') shows that WM-3835 makes additional 
interactions with the protein surface, which may explain the increased 
activity of WM-3835 against HBO1. Specifically, the WM-3835 phenol 
forms a hydrogen-bonding network with GluS25 and Lys488, neither 
of which is conserved throughout the MYST family. 

WM-3835 is a cell-permeable small molecule that results in a rapid 
and selective reduction in levels of H3K14ac (Fig. 4d). Treatment of a 
diverse set of AML cell lines with WM-3835 resulted in a marked reduc- 
tion in tumour-cell viability (Fig. 4e, Extended Data Fig. 10a) that was 
not observed after treatment with the inactive analogue WM-2474” 
(Extended Data Fig. 8b). Notably, we observed an excellent dose- 
response relationship between a reduction of levels of H3K14ac and 
cell viability (Extended Data Fig. 8c, d). Although WM-3835 retains 
potency against MOZ and QKF (also known as MORF or KAT6B), 
CRISPR-CAS9-mediated deletion of these enzymes does not alter the 
activity of WM-3835 (Fig. 4f), which highlights the fact that the efficacy 
of WM-3835 in AML is primarily via HBO1 inhibition. Moreover, treat- 
ment of cells with WM-3835 phenocopied the molecular and cellular 
effects of genetic depletion of HBO1 by inducing apoptosis, aGO/G1 
cell-cycle arrest, differentiation of human AML cells and transcriptional 
repression on HOXA9 and HOXA1O0 (Fig. 4g, i, Extended Data Fig. 8e-h). 
Similar to our genetic studies, overexpression of Hoxa9 and HoxalO 
ameliorated the effects of WM-3835 (Extended Data Fig. 8i). Although 
the rapid metabolism—including glucuronidation of WM-3835— 
precluded efficacy experiments in vivo (Extended Data Fig. 10b-c), the 
compound showed a prominent reduction of clonogenic potential in 
primary human AML cells (derived from several patients) that contained 
different driver mutations, highlighting the therapeutic potential of 
catalytic inhibitors against HBO1in AML (Fig. 4j). 

Central to the ambition to alter the natural history of AML is the 
requirement for new therapies that effectively target LSCs from the out- 
set. LSCs serve as the reservoir for evolving resistance to conventional 
and targeted therapies, and our clinical experience has clearly proven 
that monotherapies are incapable to subvert the vast adaptive poten- 
tial of LSCs. Therefore, the future lies in identifying key therapeutic 
targets in LSCs that can be leveraged in combination with other effective 
agents, including conventional chemotherapy. Here we identify HBO1 
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as atargetable dependency in LSCs. Our molecular insights suggest that 
MLLI recruits HBOI1 to regulate highly expressed LSC genes (including 
the HOXA cluster) through H3K14ac, which potentiates the activity 
of specific chromatin remodelling complexes and enables a greater 
processivity of RNA Pol-II (Fig. 4k). The blueprint for selective and 
potent inhibition of HBO1, together with these biological insights, 
provide the impetus and platform for the translation of these findings 
into the clinical setting. 


Online content 


Any methods, additional references, Nature Research reporting sum- 
maries, source data, extended data, supplementary information, 
acknowledgements, peer review information; details of author con- 
tributions and competing interests; and statements of data and code 
availability are available at https://doi.org/10.1038/s41586-019-1835-6. 


1. Dawson, M. A. The cancer epigenome: concepts, challenges, and therapeutic 
opportunities. Science 355, 1147-1152 (2017). 

2. Thomas, D. & Majeti, R. Biology and relevance of human acute myeloid leukemia stem 
cells. Blood 129, 1577-1585 (2017). 

3. Wang, T. et al. Gene essentiality profiling reveals gene networks and synthetic lethal 
interactions with oncogenic Ras. Cell 168, 890-903 (2017). 

4.  Tzelepis, K. et al. A CRISPR dropout screen identifies genetic vulnerabilities and 
therapeutic targets in acute myeloid leukemia. Cell Reports 17, 1193-1205 (2016). 

5. Zuber, J. et al. RNAi screen identifies Brd4 as a therapeutic target in acute myeloid 

eukaemia. Nature 478, 524-528 (2011). 

6. Ying, Q. L. etal. The ground state of embryonic stem cell self-renewal. Nature 453, 

519-523 (2008). 

7. Fong, C. Y. et al. BET inhibitor resistance emerges from leukaemia stem cells. Nature 525, 

538-542 (2015). 

8. Grimwade, D., Ivey, A. & Huntly, B. J. Molecular landscape of acute myeloid leukemia in 

younger adults and its clinical relevance. Blood 127, 29-41 (2016). 

9. Tsherniak, A. et al. Defining a cancer dependency map. Cell 170, 564-576 (2017). 

10. ueh, A. J., Dixon, M. P., Voss, A. K. & Thomas, T. HBO1 is required for H3K14 acetylation 

and normal transcriptional activity during embryonic development. Mol. Cell. Biol. 31, 

845-860 (2011). 

11. Doyon, Y. et al. ING tumor suppressor proteins are critical regulators of chromatin 
acetylation required for genome expression and perpetuation. Mol. Cell 21, 51-64 
(2006). 

12. Sauer, T. et al. MYST2 acetyltransferase expression and histone H4 Lysine acetylation are 
suppressed in AML. Exp. Hematol. 43, 794-802 (2015). 

13. Weinert, BT. et al. Time-resolved analysis reveals rapid dynamics and broad scope of the 

CBP/p300 acetylome. Cell 174, 231-244 (2018). 

4. Saksouk, N. et al. HBO1 HAT complexes target chromatin throughout gene coding 
regions via multiple PHD finger interactions with histone HS tail. Mol. Cell 33, 257-265 
(2009). 

5. Shi, J. et al. Discovery of cancer drug targets by CRISPR-Cas9 screening of protein 
domains. Nat. Biotechnol. 33, 661-667 (2015). 

6. Krivtsov, A. V. et al. Transformation from committed progenitor to leukaemia stem cell 
initiated by MLL-AF9. Nature 442, 818-822 (2006). 

7. Mupo, A. et al. A powerful molecular synergy between mutant nucleophosmin and FIt3- 
ITD drives acute myeloid leukemia in mice. Leukemia 27, 1917-1920 (2013). 

8. Alharbi, R. A., Pettengell, R., Pandha, H. S. & Morgan, R. The role of HOX genes in normal 
hematopoiesis and acute leukemia. Leukemia 27, 1000-1008 (2013). 

9. Ballabio, E. & Milne, T. A. Epigenetic control of gene expression in leukemogenesis: 
cooperation between wild type MLL and MLL fusion proteins. Mol. Cell. Oncol. 1, 
e955330 (2014). 

20. Skucha, A. et al. MLL-fusion-driven leukemia requires SETD2 to safeguard genomic 

integrity. Nat. Commun. 9, 1983 (2018). 

21. Church, M., Smith, K. C., Alhussain, M. M., Pennings, S. & Fleming, A. B. Sas3 and 
Ada2(Gcn5)-dependent histone H3 acetylation is required for transcription elongation at 
the de-repressed FLO7 gene. Nucleic Acids Res. 45, 4413-4430 (2017). 

22. Shen, W. et al. Solution structure of human Brg] bromodomain and its specific binding to 
acetylated histone tails. Biochemistry 46, 2100-2110 (2007). 

23. Huber, F. M. et al. Histone-binding of DPF2 mediates its repressive role in myeloid 
differentiation. Proc. Natl Acad. Sci. USA 114, 6016-6021 (2017). 

24. Dann, G.P. et al. ISWI chromatin remodellers sense nucleosome modifications to 
determine substrate preference. Nature 548, 607-611 (2017). 

25. Baell, J. B. et al. Inhibitors of histone acetyltransferases KAT6A/B induce senescence and 
arrest tumour growth. Nature 560, 253-257 (2018). 


Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© The Author(s), under exclusive licence to Springer Nature Limited 2019 


Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Cell culture 

MLL-AF9 bulk blasts and LSCs were generated as previously described’. 
Mouse and human cell lines (NOMO-1, Molm13, MV4;11, HL-60, OCI-M2, 
OCI-AML3, K562, NB4, SKM-1and KG-1) were maintained in RPMI-1640 
supplemented with 10% FCS, 2 mM GlutaMAX, 100 IU mI” penicillin, 
100 pg mI" streptomycin under standard culture conditions (5% CO,, 
37 °C). Blasts and LSCs were maintained in the presence of 0.1% DMSO 
or 1pMI-BET151, respectively, and IL-3 (10 ng ml’). HEK293T cells 
were maintained in DMEM supplemented with 10% FCS, 100 IU mI™ 
penicillin, 100 pg ml streptomycin in 10% CO, at 37 °C. All cell lines 
were regularly tested and verified to be mycoplasma-negative by PCR 
analysis by in-house genotyping. Human cell lines were authenticated 
by short tandem repeat profiling through the Australian Genome 
Research Facility. 


Virus production and transduction 

Retrovirus was produced by triple transfection of HEK293T cells with 
a retroviral LMP-blue fluorescent protein (BFP) transfer vector and 
structural pMD1-gag-pol plasmid and Vsv-g envelope plasmid at a 
0.75:0.22:0.03 ratio, as previously described”*. Lentivirus was produced 
by triple transfection of HEK293T cells witha lentiviral transfer vector, 
and the packaging plasmids psPAX2 and Vsv-g at a 0.5:0.35:15 ratio. 
All transfections were performed using polyethylenimine (PEI). Viral 
supernatants were collected 48 h after transfection, filtered through 
a0.45-um filter and added to target cells. 

To examine the effect of loss of MOZ function on progression of 
leukaemia, fetal liver cells were isolated from embryos with a germline 
deletion of Moz or littermate controls”’ at embryonic day (E)13. E13 
embryos were used because Moz-null embryos die by E14. Fetal liver 
cells (C57B/6; CD45.2 cell-surface phenotype) were transfected with 
MSCV expressing either MLL-AFP and GFP or MEIS1, HOXA9 and GFP or 
control viruses: empty vector (GFP) or MEISI-GFP or HOXA9-GFP alone 
(not shown); and prepared for infection as in the previous paragraph, 
except that the ECO envelope protein was used. After overnight culture, 
infected fetal liver cells were injected into C57B/6 CD45.1 recipient mice, 
which had been irradiated with a single dose of 700 rads. 


Pooled negative-selection RNA interference screening 

Acustom shRNA library targeting 270 mouse epigenetic enzymatic 
genes was designed using the Designer of Small Interfering RNA website 
(http://biodev.cea.fr/DSIR/DSIR.html) and subcloned into the LMP- 
BFP vector with selectable markers eBFP or puromycin, as previously 
described”’. After sequence verification, 1,922 shRNAs (6-8 per gene) 
were combined with several positive- and negative-control shRNAs 
at equal concentration in one pool. This pool was used to produce 
retrovirus, which was then transduced into 4 x 10° MLL-AF9 bulk blasts 
and LSCs at a multiplicity of infection of 0.3 and selected with 3 and 
5 ug ml‘ puromycin, respectively, commencing 30 h after transduc- 
tion. Throughout 14 days of puromycin selection, more than 20 mil- 
lion cells were maintained at each passage to preserve 10,000-fold 
library representation. Genomic DNA from day 2 to day 14 was isolated 
(DNeasy Blood & Tissue Kit, Qiagen) from both blasts and LSCs. shRNA 
sequences were amplified by PCR with primers containing adaptors for 
Illumina sequencing, as previously described”. The resulting librar- 
ies were sequenced with single-end 50-bp reads on a HiSeq2500. The 
shRNA sequences were mapped to the shRNAs within the pool, and the 
shRNA counts were analysed as previously described”. The likelihood 
ratio test was used to determine the hairpins that were significantly 
depleted over the time course of the experiment. Genes with at least 


two hairpins depleted by greater than tenfold were considered to be 
significant dependencies. 


CRISPR-Cas9-mediated gene disruption 

sgRNA oligonucleotides (Sigma-Aldrich) were phosphorylated, 
annealed and cloned into lentiviral expression vectors, pKLV- 
U6gRNA(BbslI)-PGKpuro2ABFP (Addgene 50946, deposited by K. Yusa). 
Cells were first transduced with the FUCas9Cherry (Addgene 70182, 
deposited by M. Herold) and sorted by fluorescence-activated cell sort- 
ing for high mCherry expression, and then subsequently transduced 
with the pKLV sgRNA expression vector. 


shRNA and sgRNA competitive proliferation assay 

Bulk blasts and LSCs were transduced with retrovirus expressing a 
gene-specific shRNA, the percentage of BFP-positive cells was measured 
between day 1 and day 13 after transduction, and normalized to the 
percentage of BFP positive cells at day 1. For sgRNA, Cas9-expressing 
cells were transduced witha lentivirus expressing an sgRNA targeting 
Hboland the percentage of double-positive BFP and mCherry cells was 
measured between day 2 and day 14 after transduction, and normalized 
to the percentage of BFP*mCherry’ cells at day 2 or day 4. AllshRNA and 
sgRNA sequences are provided in Supplementary Table 3. 


Antibodies 

Antibodies used for immunoblotting were rabbit anti-acetyl-histone 
H3 (Lys14) (D4B9, Cell Signalling Technology), mouse anti-histone 
H3K14ac (13HH3-1AS, Active Motif), rabbit anti-histone H3 (acetyl K27) 
(ab4729, Abcam), rabbit anti-KAT7/HBO1/MYST2 (ab70183, Abcam), 
rabbit anti-histone H3 (ab1791, Abcam), mouse anti-HSP60 (C10, Santa 
Cruz), rabbit anti-HSP60 (H-300, Santa Cruz), mouse anti-Flag (M2, 
Sigma), mouse anti-RNA polymerase II (CTD4H8, Millipore). Antibod- 
ies used for flow cytometry were Alexa Fluor 700 anti-GR1 (108422, 
BioLegend) and Brilliant Violet 605 anti-CD11b (101237, Biolegend) and 
APC/Cy7 anti-mouse CD117 (also known as KIT) (313228, Biolegend). 


Flow cytometry 

For cell apoptosis, shRNA- or sgRNA-positive cells were washed once 
with PBS andassessed using FITC-conjugated annexin V (640906, Bio- 
legend) and 4’,6-diamidino-2-phenylindole (DAPI) (D9542, Sigma) 
staining, according to the manufacturer’s instructions. For cell-cycle 
analysis, sSaRNA- or sgRNA-positive cells were washed with PBS and 
fixed for at least 2h at -20°C in 70% ethanol. Fixed cells were washed 
with PBS and incubated at 4 °Cin DAPI staining solution (1mg mI DAPI, 
0.05% (v/v) Triton X-100 in PBS) for 30 min. For surface expression of 
myeloid markers, shRNA- or sgRNA-positive cells were washed in PBS, 
and stained for GR1 or CD11b on ice for 30 min in PBS plus 5% FCS. All 
flow cytometry analyses were performed on aLSR Fortessa X-20 flow 
cytometer (BD Biosciences) and all data were analysed with FlowJo. 
Cell sorting was performed ona FACSAria Fusion 5 (BD Biosciences). 


Immunoblotting 

shRNA- or sgRNA-positive cells were lysed in 20 mM HEPES pH7.9, 0.5 
mM EDTA, 2% SDS plus 1x protease inhibitor cocktail (Roche) by brief 
sonication. Lysates were heated to 95 °C in SDS sample buffer with 50 
mM DTT for 5 min, separated by SDS-PAGE and transferred to PVDF 
membrane (Millipore). Membranes were blocked in 5% milk in TBS 
+0.1% Tween-20, probed with the indicated antibodies, and reactive 
bands were visualized using ECL Prime (GE). 


Analysis of HBO1-regulated acetylation of core histones 

Mouse MLL-AF9 cre*” Hbo&“ conditional knockout cells were labelled 
using stable isotope labelling by amino acids in cell culture, with 
ight’ (?C,,“N,-arginine and ’C,,“4N,-lysine) and ‘heavy’ ((°C,,°N,- 
arginine and °C,,°N,-lysine, Cambridge Isotope Laboratories) amino 
acids. To delete HBOI, the heavy-labelled cells were treated with 
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4-hydroxytamoxifen (200 nM) for about 40 h, and light-labelled con- 
trol cells were treated with vehicle control. Histones were extracted as 
previously described”. In brief, cells were lysed mechanically in ice-cold 
hypotonic lysis buffer (10 mM Tris pH 8.0, 1 mMMKCI, 1.5 mM MgCl, 1mM 
DTT and 1x complete protease inhibitor cocktail (Roche)) and intact 
nuclei were collected by centrifugation. Histones were acid-extracted 
with H,SO, (0.4 N) and precipitated with TCA (33% final concentra- 
tion). Purified histones from knockout and control cells were mixed 
in equal amounts, separated on SDS-PAGE and in-gel-digested with 
trypsin or LysC. (Sigma). The histone peptides were analysed by online 
nanoflow liquid-chromatography-coupled tandem mass spectrometry, 
using a Proxeon easy nLC system connected to a Q-Exactive HFX mass 
spectrometer (Thermo Scientific). The raw data were computationally 
processed using MaxQuant” (version 1.5.6.5) and searched against the 
UniProt database (downloaded 23 January 2014) using the integrated 
Andromeda search engine (http://www.maxquant.org/). The data were 
searched with three missed cleavages, a minimum peptide length of 
six amino acids and the requantify option selected, and acetylation 
was included as a variable modification. 


Rescue assays 

cDNA of Hbo1 and Hoxa10 were PCR-amplified from the cDNA library 
of mouse MLL-AF9 cells with primers containing a Flag. Hoxa9 cDNA 
was amplified from pTRE rtTA Flag Hoxa9 GFP. The catalytic mutant 
HBO1(E508Q) was generated by site-directed mutagenesis. Wild-type 
and mutant Hbol were made resistant to Hbol e12.2 sgRNA by silent 
point mutation of the protospacer-adjacent motif site corresponding 
to this sgRNA, using site-directed mutagenesis. All cDNAs were cloned 
into the lentiviral pHRSIN-P.,-y-GFP-P,,,-Puro vector**. LSCs expressing 
Cas9 were transduced with expression vectors and selected with 5 pg 
ml“ puromycin for 1 week. Overexpression lines were then subsequently 
transduced with Hbo1 sgRNA. 


Mouse details 

All mouse work was performed at the Peter MacCallum Cancer Centre 
animal facility, under approval E530 from the Peter MacCallum Cancer 
Centre animal ethics committee and at the Walter and Eliza Hall Insti- 
tute of Medical Research with approval from the Walter and Eliza Hall 
Institute Animal Ethics Committee under approval 2015.015. MX1-Cre 
Hbol"’* mice’ and Moz” mice were as previously described””. 


In vivo competition assay 

Quinary MLL-AF9 cells were transduced with non-targeting shRNA, 
Hbo1 shRNA or Moz shRNA at 90% transduction efficiency. One hun- 
dred thousand cells were transplanted 48 h after transduction into 
8-week-old female NOD/SCID///2rg-null (NSG) mice. BFP-positive 
shRNA-positive cells were determined by flow cytometry. 


Leukaemia maintenance 

The generation of Mx1-cre HboF conditional knockout mice has pre- 
viously been described”. KIT-positive cells from whole bone marrow 
were selected through magnetic bead selection (Miltenyi Biotec), and 
retrovirally transduced with the MSCV-MLL-AF9-IRES-YFP construct. 
Cells were transplanted in sublethally irradiated 6-8-week-old female 
C57BL/6 recipient mice. One hundred thousand leukaemic cells from 
the bone marrow were collected, and subsequently transplanted into 
sublethally irradiated 11-week-old female C57BL/6 recipient mice. Mice 
were randomized and pIpC (GE) was intraperintoneally administered 
6,10 and 14 days after transplantation, at 7.5 mg/kg. Amplification of 
wild-type and floxed alleles of leukaemic cells from bone marrow has 
previously been described”. 


RNA sequencing and analysis 
RNA from sgRNA-positive cells was prepared using the Qiagen 
RNeasy kit. RNA concentration was quantified with a NanoDrop 


spectrophotometer (Thermo Scientific). Libraries were prepared 
using QuantaSeq 3’ mRNA Library Prep kit (Lexogen). Libraries 
were sequenced on a NextSeq500 with 75-bp single-end reads. All 
RNA sequencing experiments were performed in triplicate. Following 
trimming of poly-A tails with cutadapt® (v.1.14), reads were aligned to 
the mouse genome (ensembl_GRC38.78) using hisat2*, and assigned 
to genes using htseq-count™. Differential gene expression analysis 
was performed using the edgeR®’ package in R (http://www.R-project. 
org/),and adjusted Pvalues were calculated using the Benjamini-Hoch- 
berg method”. Genes with fold changes (expressed as log) below -1 
and adjusted P values below 0.05 were considered to be significantly 
downregulated genes. Count data were voom-transformed using the 
voom function before performing gene-set testing with the mroast 
function*®, both from the limma package“. 


ChIP-seq and analysis 

Ten to twenty million sgRNA-positive cells were crosslinked with 1% 
formaldehyde for 10 min at room temperature, and crosslinking was 
quenched by addition of 0.125 M glycine. Cells were then lysed in 1% 
SDS, 10 mM EDTA, 50 mM Tris-HCI pH 8.0 and protease inhibitors. 
Lysates were sonicated in a Covaris ultrasonicator to achieve a mean 
DNA fragment size of 500 bp. Immunoprecipitation with anti-H3K14ac 
(Cell Signalling Technolgies) or anti-RNA polymerase II (Millipore) was 
performed overnight at 4 °C in modified RIPA buffer (10 mM Tris-HCI pH 
8.0,90 mM NaCl, 1% Triton X-100 and 0.1% deoxycholate). Protein A or 
G magnetic beads (Life Technologies) were used to bind the antibody 
and associated chromatin. Reverse crosslinking of DNA was followed 
by DNA purification using the QIAquick PCR purification kit (Qiagen). 
Sequencing libraries were prepared from eluted DNA using ThruPLEX 
DNA-seq kit (Rubicon). Libraries were size-selected between 200-500 
bp and sequenced on a NextSeq500 with 75-bp single-end reads. Fol- 
lowing the removal of IIlumina adaptors using cutadapt®, reads were 
aligned to a joint reference genome of mouse (ensembl_GRCm38.78) 
and Drosophila (ensembl_BDGP5.78) using bwa-mem (v. 0.7.13). SAM 
files were converted to BAM files using samtools” (v. 1.4.1). A scaling 
factor was calculated using the Drosophila spike-in, as previously 
described“. The scaling factor was used to normalize the coverage 
across the genome, when calculated using bamCoverage from deep- 
Tools** (v. 2.5.3) with bin sizes of 10 bp, and filtered with ENCODE project 
ChIP blacklist regions for mm10 (https://www.encodeproject.org/ 
annotations/ENCSR636HFF/). Genome-browser images were gener- 
ated from the conversion of BAM files to TDF using igvtools* (v.2.3.95). 
Heat map plots were generated using deepTools“ over the region 5 kb 
upstream to 5 kb downstream of the gene body of all genes. Coverage 
across the length of the gene body was scaled to 5 kb, and regions with 
no coverage were excluded from the plot. 


Quantitative real-time PCR 

RNA from sgRNA-positive cells 4-5 days after transduction, or cells 
treated with WM-3835 for 6-12 h, was extracted using the Qiagen RNAe- 
asy kit. cDNA was prepared using SuperScript VILO (Life Technologies) 
according to the manufacturer’s instructions. Quantitative real-time 
PCR was performed on an Applied Biosystems StepOnePlus using 
Fast SYBR green reagents (Thermo Scientific). Expression levels were 
determined using the AAC, method normalized to B2-microglobulin. 
All mRNA primer sequences are provided in Supplementary Table 3. 


Cell proliferation assays 

Cells were seeded at a constant density before treatment in triplicate, 
and treated with either 1 1M WM-3835, 1 uM WM-2474 or DMSO (0.1%) 
over the indicated time period. The drug was refreshed at least every 
two days. Cells were stained with DAPI and the live-cell number was 
calculated using the BD FACSVerse (BD Biosciences). To determine 
the half-maximal inhibitory concentration (IC,.) for the WM-3835, 4h 
after seeding the cells at a constant density in duplicate, the cells were 


treated with WM-3835, DMSO or positive control (3 1M puromycin) 
for 10 days. The drug and medium were refreshed at day 4 and day 7. 
At day 10, after incubating the cells with 600 uM of resazurin for 6h, 
fluorescence was measured at A,, = 530 nm and ,,, = 590 nm, using a 
Microplate Reader (EnSpire, Perkin Elmer). Relative fluorescence units 
were converted to per cent of inhibition relative to controls on the 
same plate, and the data were fitted against a four-parameter logistic 
model to determine the IC... 


Clonogenic assays in methylcellulose 

Clonogenic potential was assessed through colony growth of bone 
marrow cells from patients with AML plated in cytokine-supplemented 
methylcellulose (MethoCult H4434, StemCell Technologies). Bone mar- 
rowwas plated in duplicate at a cell dose of 2 x 10‘ cells per plate in the 
presence of vehicle (0.1% DMSO) or 1 14M WM-3835. Cells were incubated 
at 37 °C and 5% CO, for 12 days, at which time colonies were counted. 


Patient material 

Bone marrow containing >80% blasts was obtained from patients fol- 
lowing consent and under full ethical approval by the Peter MacCal- 
lum Cancer Centre Research Ethics Committee (reference number: 
HREC/17/PMCC/69). 


Lysine acetyltransferase biochemical assays 

KAT enzymes were either produced or purchased, as previously 
described”. Lysine acetyltransferase assays were run as previously 
described”, with two modification. First, 100 nM of full-length bioti- 
nylated histone H3 (for MOZ, QKF and HBO1) or histone H4 (for KATS 
and KATS8) proteins were used as the substrate, as indicated. Second, 
assays were run with 1M acetyl-CoA concentration, the approximate 
K,, for acetyl-CoA for these enzymes in this assay format. 


HBO1H3K14ac biomarker assay 

The cell line U2OS was seeded at a density of 3,000 cells per well in 
384-well optical-quality tissue-culture plates, in RPMI medium supple- 
mented with 10% fetal bovine serum and 10 mM HEPES. The cells were 
allowed to adhere for 24 h under standard culture conditions (37 °C and 
5% CO,). At the end of this period, the cells were washed with medium. 
Compound dilutions prepared in DMSO were added to the medium, 
with negative-control wells reserved for treatment with DMSO-only 
and 100%-inhibition positive controls at 10 pM concentration. After 
incubation for 24 h, the cells were fixed with 4% formaldehyde in PBS 
for 15 min at room temperature, washed with phosphate buffered saline 
and blocked with blocking buffer containing 0.2% TritonX100 and 2% 
BSA. anti-H3K14ac antibody (Cell Signaling Technologies) in blocking 
buffer was added and incubated overnight at 4 °C. After washing, a sec- 
ondary antibody labelled with AlexaFluor 488 dye (ThermoFisher) and 
Hoechst 33342 (1 pg/ml, Life Technologies) was added for 2hincubation 
at room temperature. Plates were washed and read on a PerkinElmer 
Opera HCS high-content imaging platform. Using a Columbus image 
analysis pipeline, individual nuclei were located by Hoechst 33342 
stain and the level of H3K14ac was calculated from the Alexa-Fluor- 
488-related intensity in the same area. The resulting mean intensity 
per cell was converted to per cent inhibition relative to controls onthe 
same plate and the data fitted against a four-parameter logistic model 
to determine the IC,o. 


HBO1-BPRF2 protein production, surface plasmon resonance 
and structural biology 

HBO1-BPRF2 protein was produced as previously described“. Sur- 
face plasmon resonance (SPR) for WM-3835 was done as previously 
described”. Crystals were grown at the CSIRO C3 crystallization centre 
in SD2 sitting-drop plates at 20 °C, with equal volumes of protein and 
crystallant (200 nl plus 200-nl drops) with the reservoir consisting of 
244 mM diammonium tartrate and 20% PEG 3350. Crystals started to 


form overnight and were collected 3 days later using 20% glycerol asa 
cryoprotectant. Data were obtained at the MX2 microfocus beamline 
at the Australian Synchrotron. The space group was found to be H3, and 
the data and refinement statistics can be found in Extended Data Fig. 6. 
The data were indexed with DIALS” (WM-3835) or XDS“8 (acetyl-CoA), 
scaled and integrated with Aimless”, the structure was solved with 
Phaser® using PDB 5GK9 as the initial model, manually refined with 
Coot™ and full refinement was done using Phenix.refine” (WM-3835) 
or REFMAC® (acetyl-CoA). Crystallization and refinement statistics 
are shown in Extended Data Fig. 10. 


In vitro metabolic stability 

The metabolic stability assay was performed by incubating each test 
compound in liver microsomes at 37 °C and a protein concentration 
of 0.4 mg/ml. The metabolic reaction was initiated by the addition of 
either single cofactor (NADPH only), or dual cofactors (NADPH and 
UDPGA), and quenched at various time points over a 60-min incuba- 
tion period by the addition of acetonitrile containing diazepam as 
an internal standard. Control samples (containing no NADPH) were 
included (and quenched at 2, 30 and 60 min) to monitor for potential 
degradation in the absence of cofactor. The human liver microsomes 
used in this experiment were supplied by XenoTech, lot no. 1410230. 
The mouse liver microsomes used in this experiment were supplied by 
XenoTech, lot no. 1510256. Microsomal incubations were performed 
at a substrate concentration of 11M. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The shRNA screen sequencing data have been deposited to the NCBI 
Sequence Archieve under the accession number GSE120813. Crystal 
structure data for HBO1-BPRF2in complex with WM-3835 and acetyl- 
CoA have been submitted to the PDB under accession numbers 6MAJ 
(WM-3835) and 6MAK (acetyl-CoA). Source Data are provided for 
Figs. 1-4. Any other relevant data are available from the correspond- 
ing author upon reasonable request. 


Code availability 
All code used in this study is publically available. 
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Extended Data Fig. 1| HBO1is an AML-specific dependency. a, RNA 
interference screen multidimensional scaling plot of shRNA sequences from 
bulk leukaemic blasts (bulk) and LSCs over 14 days of screening. Screens were 


performed in duplicate (Rl and R2). b, Expression of Ezh2, Lsd1 or Prmt5 mRNA, 


relative to non-targeting shRNA inLSCs that express shRNAs.n=3. 


Mean +s.e.m.c, Negative-selection competition assays in bulk leukaemic 
blasts and LSCs that express shRNAs against Ezh2, Lsd1, Prmt5 or Dotil.n=3. 
Mean +s.e.m. Validation of the shRNAs against Dotil have previously been 
published™. d. Expression of Hbo1 mRNA, relative to non-targeting shRNA, in 
LSCs that express shRNAs. n=3.Mean+s.e.m. 
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Extended Data Fig. 2| Expression levels of HBO1. a. Box plot of HBO1 
expression levels in cancer types from The Cancer Genome Atlas (TCGA)*>*. The 
upper limit, centre and lower limit of each box denotes the upper quartile, 
median and lower quartile of the data, respectively. Cohort abbreviations refer 


to TCGA study abbreviations (https://gdc.cancer.gov/resources-tcga-users/ 
tcga-code-tables/tcga-study-abbreviations). b, Dot plot of levels of HBO1 
expression in normal haematopoietic and AML cells from BloodSpot”; 
horizontal line indicates mean expression. 
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Extended Data Fig. 3 | HBO1 depletionincreases apoptosis, cell-cycle arrest 
and myeloid differentiation in mouse LSCs. a, Negative-selection 
competition assays in LSCs that express shRNAs against Moz, Qkfor Hbol.n=3. 
Mean +s.e.m.b, Negative-selection competition assays in LSCs that express 
Cas9, transduced with sgRNAs targeting Moz or Qkf.n=3.Mean+s.e.m. 

c, Expression of members of the HBO1 complex, relative to non-targeting 
shRNA inLSCs that express shRNAs targeting individual members of the 


complex.n=3.Mean+s.e.m.d, Apoptosis of LSCs that express shRNAS 
targeting Hbol.n=3.Mean+s.e.m.e, Cell-cycle profile of LSCs that express 
shRNAs targeting Hbol.n=3.Meants.e.m.f, Surface expression of GRland 
CD11binLSCs that express shRNASs against Hbol.n=3.g.Apoptosis of LSCs that 
express Cas9 and sgRNAs targeting Hbol.n=3.Mean+s.e.m.h, Surface 
expression of KIT (also known as CD117) in LSCs that express sgRNAs targeting 
Hbo1. Representative of n=2 biological replicates. 


Article 


a MLL-AF9 b Wildtype allele (Hbo1*) 
an mas ——— 
= — shMoz 
S — shHbo1 Sp2 
® 50 Floxed allele (Hbo1/0x) 
e Ping loxP eetonr 
: —_yt_,_;_+— 
® 
ou em P3 
0 P2 
10 20 30 ; 
Time after transplantation (d) Grepelsted alele(Hner) loxP 
P1 —_ 
Cc MLL-AF9 Hbo7 fl/+ fe eee eee ee 
100 — Saline 
s — plpc ; 
$ Saline plpc 
Ss 
@ 69 500 bp 
ro 400 bp 
oa 
2 300 bp 
[:}) 
= Ma deleted 
0 == floxed 
200 b 
0 10 20 30 : 


Time after transplantation (d) 


d MLL-AF9 
100 eae Moz +/+ 
=- Moz +/- 
s ea Moz -/- 
=> 
7) 
€ 50 ooee, 
® : 
2 : 
@® e 
o : 
; : 
0 20 40 60 
Time after transplantation (d) 
e HoxA9/Meis1 
100 =— Moz +/+ 
g e# = Moz -/- 
2 
S 
7) 
¢ 50 , 
7) ;° 
© ; 
® Pecvcvcece 
Oo 


0 20 40 60 
Time after transplantation (d) 


Extended Data Fig. 4 | In vivo depletion of HBO1 increases disease latency. 

a, Kaplan-Meier curves of NSG mice transplanted with quinary MLL-AF9 
leukaemic cells that express shRNAs targeting Hbol or Moz.n=6 mice per 
group. b, Schematic of wild-type and Hbo1-mutant alleles with numbered black 
boxes representing exons. Genotyping primers are indicated (adapted from 
ref.!°).c, Kaplan-Meier curves of C57BL/6 mice injected with 10° Hbol* Mx1- 


Oe Wi\type 


cre primary MLL-AF9 leukaemic cells. Mice were treated with saline or pIpC. 
n=12 mice per group. Right, representative genotyping of recipient bone 
marrowat end point. d, Kaplan-Meier curves of C57BL/6 mice injected with 
Moz", Moz” or Moz’ MLL-AF9 leukaemic cells. n=5 mice per group. 

e, Kaplan-Meier curves of BALB/c mice injected with Moz or Moz’ HOXA9 
and MEIS1 leukaemic cells.n=5 mice per group. 
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Extended Data Fig. 5 | HBO1is adependency in various subtypes of AML. 

a, Negative-selection competition assays in human AML cell lines that express 
Cas9 and sgRNAs targeting HBO1.n=3.Mean+s.e.m. Driver mutations arein 
parentheses. b, Immunoblot of HBO1and H3K14ac in OCI-AML3 cells that 
express Cas9 and sgRNAs targeting HBOI.n=3.Mean+s.e.m.c, Cell-cycle 
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Extended Data Fig. 6 | HBO1 dependency in other cancers. Negative-selection competition assays in human cancer cell lines that express Cas9 and sgRNAs 
targeting HBO1.n=3.Mean+s.e.m. Cancer type isin parentheses. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | HBO1 depletion increases myeloid signature and 
decreases global H3K14ac. a, Barcode plot evaluating changes in myeloid 
development signature after HBO1 depletion with Hbol ell.1and Hbo] e12.2 
sgRNAsinLSCs. n=3.b, Bar plot of changes in gene expression after Hbol 
deletion inLSCs, ranked by expression levels. Green bars show the top 25 most- 
downregulated genes after Hbol deletion. c, HOXA9 and HOXAIO mRNA 
expression in Molm13 and OCI-AML3 cells that express Cas9 and sgRNA 
targeting HBO1.n=3.Mean+s.e.m.d, Surface expression of CD11binLSCs that 
overexpress Hoxa9 or Hoxal0, and sgRNAs targeting Hbol.e, Immunoblot of 


overexpressed HOXA9 or HOXA10 in LSCs that express Cas9. Representative of 
n=3 biological replicates. f, Hoxa9- and Hoxal0-rescue negative-selection 
competition assays. Representative of n=3 biological replicates. g, ChIP-seq 
profiles of H3K14ac and RNA Pol-II at the Pbx3 locus in LSCs that express Cas9 
and ansgRNA targeting Hbol1. Representative of n=3 biological replicates. 

h, RNA Pol-II travelling ratio distribution for highly expressed genes, divided 
according to H3K14ac levels from ChIP-seq. i, Surface expression of GR1lin 
LSCs that overexpress sgRNAs targeting SmarcaS. Representative of 

n=3 biological replicates. 
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Extended Data Fig. 8 |WM-3835 inhibits cell growth and HOXA expression in 
AML.a, KAT biochemical and SPR values for WM-3835 compared to WM-1119. 
Biochemical assay was done at 1pM acetyl-CoA, the K,, of HBO1.b, Proliferation 
assays of human AML cells treated with 1 4M WM-2474. n=3.Mean+s.e.m. 

c, Cellular H3K14ac biomarker assay dose-response curves for WM-3835 (blue) 
and WM-1119 (red). n=6. Mean +s.e.m.d, Growth inhibition assays of the 
MLL-AF9 AML cell line Molm13 treated with WM-3835 at the doses indicated. 
Boxes represent minimum and maximum values. n=11.Mean+s.e.m. 


e, Cell-cycle profile of OCI-AML3 cells treated with WM-3835 or vehicle. n=3. 
Mean +s.e.m. f, Apoptosis of OCI-AML3 cells treated with WM-3835 or vehicle. 
n=3.Mean+s.e.m.g, Surface expression of CD11b in OCI-AML3 cells treated 
with WM-3835 or vehicle. n=3.h, HOXAIO mRNA expression in LSCs and 
Molm13 cells treated with WM-3835 or vehicle. n=3.Mean+s.e.m.i, HOXA9 and 
HOXA10 mRNAexpression in OCI-AML3 cells treated with WM-3835.n=3. 
Mean +s.e.m.j, Hoxa9 and Hoxal0 rescue proliferation assays with 1 14M WM- 
3835 in LSCs.n=3.Mean+s.e.m. 
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Extended Data Fig. 9 |HBO1 crystal structure. a, Datacollectionand 
refinement statistics of WM-3835 HBO1-BRPF2co-crystal structure. b, WM- 
3835 binding site in HBO1-BRPF2. WM-3835 is shown in silver with element 
colouring and the omit electron density map, contoured to 30, is shownin 
green.c, Overlay of WM-3835 and acetyl-CoA (purple with element colouring), 


showing that WM-3835 binds in the acetyl-CoA binding site of HBO1.d, Ribbon 
diagram of HBO1-BRPF2 showing WM-3835 bound to the acetyl-CoA binding 
site. e, Space-filling model showing WM-3835 (yellow with element colouring) 
inthe acetyl-CoA binding pocket of HBO1-BRPF2. 
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Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | High in vitro metabolism and poor in vivo oral 
exposure of WM-3835. a, Proliferation assays of human AML cell lines treated 
with1 LM WM-3835. b, WM-3835 demonstrates high clearance in both human 
and mouse liver microsome assays. The use of dual cofactors (UDPGA and 
NADPH) results in an increased rate of clearance in human liver microsomes, 
whichis consistent with glucuronidation having a role in the clearance of this 
compound. c, BALB/c female mice were dosed with WM-3835 at 100 mgkg? 
taken by mouth twice a day, formulated in 20% PEG400/10% solutol or vehicle. 
Four hours after the third dose, blood samples were collected. An average total 
drug concentration of 1,860 nM was observed. The free drug level was 


determined to be 2.6 nM after accounting for mouse plasma protein binding 
(fraction unbound 0.0014). This free drug level was considered toolowto 
affect the acetylation of H3 at K14, on the basis of the in vitro H3K14ac cellular 
biomarker data (Fig. 4). An additional chromatographic peak eluting earlier 
than WM-3835 was detected in the plasma samples fromthe treatment group. 
Subsequent analyses using predicted multiple-reaction monitoring and 
accurate mass measurement indicated that this peak is likely tobea 
glucuronide conjugate of WM-3835, consistent with the in vitro metabolism 
data. 
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Next-Seq500 (Illumina); Quantative PCR data was collected using StepOnePlus Real-Time PCR system (Applied Biosystems) 
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performed using R or Prism 7 (GraphPad), MaxQuant was used to process LC-MS/MS data. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


The sequencing data that support the findings of this study has been deposited into the sequence read archive, which is hosted by the National Centre for 
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Data exclusions Mice were excluded from the Kaper-Meier curves if they had less than 20% detectable leukemic cells in the bone marrow or spleen at the 
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were able to be reliably reproduced. 


Randomization — Allocation of mice to different groups was random. 


Blinding Experimenters were not blinded to the experiment. However, all mouse experiments were performed by the core facility staff who have no 
direct involvement in the project and were blinded to the genotype of the mice or the shRNA that was used in the experiment. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used rabbit anti-acetyl-histone H3 (Lys 14) (D4B9, Cell Signalling Technology, Lot# 4) WB: 1:1000, ChIP-seq: 10 ug, mouse anti-histone 
H3K14Ac (13HH3-1A5, Active Motif, Lot# 10715003/11709001) WB: 1:1000, rabbit anti-histone H3 (acetyl K27) (ab4729, Abcam, 
Lot# GR3205526-1/GR3211959-1) WB: 1:5000, rabbit anti-KAT7/Hbo1/MYST2 (ab70183, Abcam, Lot# GR222365-1) WB: 1:1000, 
rabbit anti-histone H3 (ab1791, Abcam, Lot# GR234589-1) WB: 1:10000, mouse anti-HSP60 (C10, Santa Cruz, Lot# H1913) WB: 
1:10000, rabbit anti-HSP60 (H-300, Santa Cruz, Lot# GO119) WB: 1:10000, mouse anti-FLAG (M2, Sigma, Lot# SLBW3851) WB: 
1:2000, mouse anti-RNA polymerase II (CTD4H8, Millipore, Lot# 2958013) ChIP-seq: 10 ug. Gt-anti-Rb IgG (H+L) secondary 
antibody, HRP conjugate (65-6120, Invitrogen, Lot# TH275335), Goat anti-mouse lg, Human ads-HRP (1010-05, Southern 
Biotech, Lot# HO416-T867C).WB secondary antibodies (both anti-Rb and -Ms) were used at dilutions ranging 1:10000 to 
1:50000.Flow Cytometry: Alexa Fluor 700 anti-Gr1 (108422, BioLegend, Lot# B199432/B258102) and Brilliant Violet 605 anti- 
CD11b (101237, Biolegend, Lot# B266535), APC-Cy7 anti-mouse CD117 (c-kit) (313228, Biolegend, Lot# B163721), FITC-Annexin 
V (640906, Biolegend, Lot# 8311824). 


Validation All of the antibodies used have been extensively utilized in the literature and have been validated previously. 
H3K14ac and H3K23ac antibodies have been validated using the Epicypher panel. H3K14ac and Hbo1 antibodies were validated 
for WB in house using knockout cell lines (Cre lox and CRISPR-Cas9). H3K27ac and Total H3 for WB have and RNA Pol-I| for ChIP- 
seq have been validated and previously published by our laboratory (Gilan et al, NSMB, 2016). All flow cytometry antibodies have 
been previously validated by our laboratory (Fong et al., Nature, 2015). 
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Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) LSC and Blast cell lines were generated in house (Fong et al, Nature, 2015) 
Molm13, MV4;11, K562, HL-60, MCF7, T47D, MDA-MB-231, 2-138, HCC-827, NCI-H146, LnCaP, BxPC3 and A375 cell lines 
were obtained from ATCC 
NOMO-1 cells were obtained from JCRB 
OCI-AML3, OCI-M2, NB4, Skm1 and Kelly cell lines were obtained from DSMZ 
WM-852 were obtained from ESTDAB 


Authentication STR testing was performed to authenticate cell lines 
Mycoplasma contamination All cell lines used are mycoplasma negative as tested by the Victorian Infectious Diseases References Laboratory 


Commonly misidentified lines No commonly misidentified cell lines were used. 
(See ICLAC register) 
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Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals C57BL/6 female mice 
All mice were 6-8 weeks old 
Further details in Methods. 


Wild animals This study did not involve wild animals 
Field-collected samples No data in this study were field-collected 
Ethics oversight All scientific procedures using animals were carried out in accordance with the Prevention of Cruelty to Animals Act 1986 Act 


(the Act), associated Regulations and the Australian Code of Practice for the Care and Use of Animals for Scientific Purposes (the 
Code). Experiments were performed under the approved animal ethics experimentation committee permit number E615 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Human bone marrow samples from patients with acute myeloid leukaemia. Patient characteristics are representative of the 
general population. 


Recruitment Patients are recruited via a PICF to participate in the project titled Molecular Characterisation of Response, Resistance and Clonal 
Evolution in Haematological Malignancies. We do not foresee biases or any impact on results due patient this selection criteria. 


Ethics oversight The project has been reviewed and approved by the Peter MacCallum Cancer Centre Human Research Ethics Committte 
Reference number: HREC/17/PMCC/69 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


ChIP-seq 


Data deposition 


Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links For "Initial submission" or "Revised version" documents, provide reviewer access links. For your "Final submission" document, 
May remain private before publication. provide a link to the deposited data. 

Files in database submission Provide a list of all files available in the database submission. 

Genome browser session Provide a link to an anonymized genome browser session for "Initial submission" and "Revised version" documents only, to 
(e.g. UCSC) enable peer review. Write "no longer applicable" for "Final submission" documents. 


Methodology 


Replicates Describe the experimental replicates, specifying number, type and replicate agreement. 


Sequencing depth 
Antibodies 

Peak calling parameters 
Data quality 


Software 


Flow Cytometry 


Describe the sequencing depth for each experiment, providing the total number of reads, uniquely mapped reads, length of 
reads and whether they were paired- or single-end. 


Describe the antibodies used for the ChiP-seq experiments; as applicable, provide supplier name, catalog number, clone 
name, and lot number. 


Specify the command line program and parameters used for read mapping and peak calling, including the ChIP, control and 
index files used. 


Describe the methods used to ensure data quality in full detail, including how many peaks are at FDR 5% and above 5-fold 
enrichment. 


Describe the software used to collect and analyze the ChiP-seq data. For custom code that has been deposited into a 
community repository, provide accession details. 


Plots 


Confirm that: 


Methodology 


Sample preparation 
Instrument 
Software 


Cell population abundance 


Gating strategy 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 
The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 
All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Cultured cell lines 
BD Bioscience BD FACSVERSE BD FACSFortessa 
BD Bioscience BD FACSuite software was used to collect and analyse data, and FlowJo 


Describe the abundance of the relevant cell populations within post-sort fractions, providing details on the purity of the samples 
and how it was determined. 


FSC-A and FSC-H was used to identify single cells, cells were separated by viability using DAPI or PI exclusion and (AnnexinV for 
apoptosis assay) 


[| Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Bacteria use adaptive immune systems encoded by CRISPR and Cas genes to maintain 
genomic integrity when challenged by pathogens and mobile genetic elements! >. 


Type I CRISPR-Cas systems typically target foreign DNA for degradation via joint 
action of the ribonucleoprotein complex Cascade and the helicase-nuclease Cas3*°, 
but nuclease-deficient typeI systems lacking Cas3 have been repurposed for RNA- 
guided transposition by bacterial Tn7-like transposons®’”. How CRISPR- and 
transposon-associated machineries collaborate during DNA targeting and insertion 
remains unknown. Here we describe structures of a TniQ-Cascade complex encoded 
by the Vibrio cholerae Tn6677 transposon using cryo-electron microscopy, revealing 
the mechanistic basis of this functional coupling. The cryo-electron microscopy maps 
enabled de novo modelling and refinement of the transposition protein TniQ, which 
binds to the Cascade complex as a dimer in a head-to-tail configuration, at the 
interface formed by Cas6 and Cas7 near the 3’ end of the CRISPR RNA (crRNA). The 
natural Cas8—Cas5 fusion protein binds the 5’ crRNA handle and contacts the TniQ 
dimer viaa flexible insertion domain. A target DNA-bound structure reveals critical 
interactions necessary for protospacer-adjacent motif recognition and R-loop 
formation. This work lays the foundation for a structural understanding of how DNA 
targeting by TniQ-Cascade leads to downstream recruitment of additional 
transposase proteins, and will guide protein engineering efforts to leverage this 
system for programmable DNA insertions in genome-engineering applications. 


We previously demonstrated that a transposon derived from V. cholerae 
Tn6677 undergoes programmable transposition in Escherichia coli 
directed byacrRNA, and that this activity requires four transposon- and 
three CRISPR-associated genes in addition to a CRISPR array’ (Fig. 1a). 
Whereas TnsA, TnsB and TnsC exhibit functions that are consistent with 
their homologues from the related and well-studied cut-and-paste DNA 
transposon E. coli Tn7’, we showed that TniQ, a homologue of £. coli 
TnsD, forms a co-complex with the Cascade ribonucleoprotein com- 
plex encoded by the type I-F variant CRISPR-Cas system. This finding 
suggested an alternative role for TniQ, compared with the role of E. coli 
TnsD in identifying target sites during Tn7 transposition. We proposed 
that RNA-guided DNA targeting by Cascade could deliver TniQ to DNA 
ina manner compatible with downstream transpososome formation, 
and that TniQ might interact with Cascade near the 3’ end of thecrRNA, 
consistent with RNA-guided DNA insertion occurring around 49 bp down- 
stream of the protospacer-adjacent motif (PAM)-distal edge of the target 
site. To determine this unambiguously, we purified the V. cholerae TniQ- 
Cascade complex loaded with a native crRNA and determined its struc- 
ture by cryo-electron microscopy (cryo-EM) (Supplementary Table 1). 


Cryo-EM structure of TniQ-Cascade complex 

The overall complex adopts a helical architecture with protuberances 
at both ends (Fig. 1, Extended Data Figs. 1, 2). The global architecture is 
similar to previously determined structures of Cascade from I-E and I-F 


systems’ ” (Extended Data Fig. 3), with the exception of a large mass 
of additional density attributable to TniQ (see below). Maximum like- 
lihood classification methods implemented in Relion3” enabled us 
to identify marked dynamics in the entire complex, which appears 
to ‘breathe’, widening and narrowing the distance between the two 
protuberances (Extended Data Fig. 1d, Supplementary Video 1). The 
large subunit encoded by anatural Cas8-Cas5 fusion protein (hereafter 
referred to simply as Cas8) forms one protuberance and recognizes the 
5’ end of the crRNA via base- and backbone-specific contacts (Extended 
Data Figs. 4, 5a-—c, 6a), similar to the canonical roles of Cas8 and Cas5 
(Extended Data Fig. 3). Cas8 contains two primary subdomains formed 
mainly by a-helices and a third domain of approximately 100 residues 
(residues 277 to 385) that is predicted to form three a-helices but could 
not be built in our maps owing to its intrinsic flexibility (Fig. 1c). How- 
ever, low-pass-filtered maps revealed that this flexible domain connects 
with the TniQ protuberance at the opposite end of the crescent-shaped 
complex (Extended Data Fig. 2e). Additionally, there seemed to bea 
loose coupling between the Cas§8 flexible domain and overall breathing 
of the complex, as stronger density for that domain could be observed 
inthe closed state (Extended Data Fig. 1d, Supplementary Video 1). 
Six Cas7 subunits protect much of the crRNA by forming a helical 
filament along its length (Fig. 1b, d), similar to other type I Cascade 
complexes? ” (Extended Data Fig. 3). A ‘finger’ motif in Cas7 clamps 
the crRNA at regular intervals, causing every sixth nucleotide (nt) of 
the 32-nt spacer to flip out while leaving the flanking nucleotides 
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Fig. 1| Overall architecture of the V. cholerae TniQ-Cascade complex. 
a, Genetic architecture of the Tn6677 transposon (top), and plasmid constructs 
used to express and purify the TniQ-Cascade complex. Right, selected cryo-EM 
reference-free two-dimensional class averages in multiple orientations. 
b, Orthogonal views of the cryo-EM map of the TniQ-Cascade complex, 
showing Cas8 (purple), six Cas7 monomers (green), Cas6 (salmon), crRNA (grey) 


available for DNA recognition (Extended Data Figs. 4f, 6). These bases 
are pre-ordered in short helical segments, with a conserved pheny- 
lalanine stacking below the first base of every segment. Cas7.1, the 
monomer furthest away from Cas8, interacts with Cas6 (also known 
as Csy4), which is the RNase responsible for processing of the precur- 
sor RNA transcript derived from the CRISPR locus. The Cas6—Cas7.1 
interaction is mediated by a B-sheet formed by the contribution ofa 
B-strand from Cas6 and the two B-strands that form the finger of Cas7.1 
(Extended Data Fig. 5f). Cas6 also forms extensive interactions with 
the conserved stem-loop in the repeat-derived 3’ crRNA handle (Fig. 1, 
Extended Data Fig. 5d, e), with an arginine-rich a-helix (residues 110 to 
128) docked in the major groove, positioning multiple basic residues 
within interaction distance of the negatively charged RNA backbone. 

The interaction established between Cas6 and Cas7.1formsacontinu- 
ous surface on which TniQ is docked, forming the other protuberance 
of the crescent. The intrinsic flexibility ofthe complex resulted in lower 
local resolutions in this area of the maps, which we overcame using 
local alignments masking the area comprising TniQ, Cas6, Cas7.1 and 
the crRNA handle (Extended Data Fig. 7). The enhanced maps enabled 
de novo modelling and refinement of TniQ, for which no previous struc- 
ture or homology model has been reported, to our knowledge (Fig. 2). 
Notably, TniQ binds to Cascade as a dimer with head-to-tail configura- 
tion (Fig. 2),a surprising result given the expectation that F. coliTnsD 
functions as amonomer during Tn7 transposition”. 


TniQ binds to Cascade as a dimer 


TniQ is composed of two domains: an N-terminal domain of approxi- 
mately 100 residues formed by three short a-helices and asecond, larger 
domain of approximately 300 residues with a signature sequence for 
the TniQ family. A DALI search* using the refined TniQ model as a probe 
yielded marked structural similarity of the N-terminal domain to pro- 
teins containing helix-turn-helix (HTH) domains. This domain is often 
involvedin nucleic acid recognition; however, there are examples where 
ithas been re-purposed for protein-protein interactions’®. The remaining 
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Cryo-EM map 


and TniQ monomers (blue, yellow). The complex adopts a helical architecture 
with protuberances at both ends. c, A flexible domain in Cas8 comprising 
residues 277-385 (grey) could only be visualized in low-pass-filtered maps. The 
unsharpened map is shownas semi-transparent, grey map overlaid on the post- 
processed map segmented and coloured asina.d, Refined model for the TniQ- 
Cascade complex derived from the cryo-EM maps shown inb. 


C-terminal TniQ domain is formed by ten a-helices of variable length and 
is predicted to contain two tandem zinc finger motifs, although this region 
was poorly defined in the maps (Fig. 2). Overall, the double domain com- 
position of TniQ results in an elongated structure, bent at thejunction of 
the HTH and the TniQ domain (Fig. 2). The HTH domain of one monomer 
engages the TniQ domain of the other monomer viainteractions between 
a-helix 3 (H3) and a-helix 12 (H12), respectively, inatight protein-protein 
interaction (Fig. 2c). This reciprocal interaction is complemented by 
multiple interactions established between the TniQ domains from both 
monomers (up to 45 non-covalent interactions as reported by PISA”). 
Tethering of the TniQ dimer to Cascade is accomplished by specific 
interactions established with both Cas6 and Cas7.1 (Fig. 3). One monomer 
of TniQ interacts with Cas6 via its C-terminal TniQ domain, whereas the 
other TniQ monomer contacts Cas7.1throughits N-terminal HTH domain 
(Figs. 2b, 3). The loop connecting a-helices H7 and H8 of the TniQ domain 
of the first TniQ monomer is inserted in a hydrophobic cavity formed 
at the interface of two a-helices of Casé6 (Fig. 3b, d). The TniQ histidine 
residue 265 is involved in rearranging the hydrophobic loop connecting 
H7 and H8 (Fig. 3d), whichis inserted in the hydrophobic pocket of Cas6 
formed by residues L20, Y74, M78, Y83 and F84. The buried surface in the 
Cas6-TniQ.linteraction interface has an area of 420 A2. The HTH domain 
of the other TniQ monomer interacts with Cas7.1 through a network of 
interactions established mainly by a-helix H2 and the linker connecting H2 
and H3, burying a surface area of 595 A? (Fig. 3c, e). Thus, the HTH domain 
and the TniQ domain exert dual roles to drive TniQ dimerization and dock 
onto Cascade. The aggregate buried surface area for the TniQ-Cascade 
interactionis1,015 A2, significantly smaller than other Cascade-effector 
interactions suchas with the nuclease Cas3, in which 2,433 A’is buried". 
This difference is not surprising given the flexibility observed for the 
TniQ dimer inits association with Cascade (Supplementary Video 1). 


Structure of the DNA-bound TniQ-Cascade complex 


To investigate the structural determinants of DNA recognition by the 
TniQ-Cascade complex, we determined the structure of the complex 


Fig. 2| TniQ binds Cascade in a dimeric, head-to-tail configuration. a, Left, 
overall view of the TniQ-Cascade cryo-EM unsharpened map (grey) overlaid on 
the post-processed map segmented and coloured as in Fig. 1. Right, cryo-EM 
map (top) and refined model (bottom) of the TniQ dimer. The two monomers 
interact with each other ina head-to-tail configuration and are anchored to 
Cascade by Cas6 and Cas7.1. b, Secondary structure diagram of the TniQ dimer: 


bound to a double-stranded DNA (dsDNA) substrate containing the 
32-bp target sequence, 5’-CC-3’ PAM, and 20 bp of flanking dsDNA on 
both ends (Fig. 4, Extended Data Fig. 8). Density for 28 nucleotides of 
the target strand and 8 nucleotides of the non-target strand could be 
confidently assigned in the reconstructed maps (Fig. 4c). As with previ- 
ous I-F Cascade structures, Cas8 recognizes the double-stranded PAM 
within the minor groove" (Extended Data Fig. 9), and an arginine residue 
(R246) establishes a stacking interaction with a guanine nucleotide on 
the target strand, which acts as a wedge to separate the double-stranded 
PAM from the neighbouring unwound DNA where base-pairing with 
the crRNA begins (Fig. 4c). 

Twenty-two nucleotides of the target strand within the 32-bp target 
showed clear density, but surprisingly, the terminal nine nucleotides 


b TniQ domain 


HTH domain 


HTH domain TniQ domain 


c ese TniQ.1 


thirteen a-helices are organized into an N-terminal HTH domain anda 
C-terminal TniQ domain. Dimer interactions between H3 and H12 are indicated, 
as are interaction sites with Cas6 and Cas7.1.c, Cryo-EM density for the H3-H12 
interaction shows clear side-chain features (top), allowing accurate modelling 
of the interaction (bottom). d, Schematic of the dimer interaction, showing the 
important dimerization interface between the HTH and TniQ domain. 


were not ordered. The target-strand base pairs with the spacer region of 
the crRNA in short, discontinuous helical segments, as observed previ- 
ously for I-E and I-F DNA-bound Cascade complexes”, with every sixth 
base flipped out of the heteroduplex by the insertion of a Cas7 finger 
(Extended Data Fig. 6b). The observed 22-bp heteroduplex is stabilized 
by the four Cas7 monomers proximal to the PAM (Cas7.6-Cas7.3), but 
even after local masked refinements, no density could be observed for 
any target strand nucleotides that would base-pair with the 3’ end of the 
crRNA spacer bound by Cas7.2 and Cas7.1. These two Cas7 monomers are 
proximal to Cas6 and inthe region previously described to exhibit dynam- 
ics owing to the interaction of the Cas8 flexible domain with the inner 
face of the TniQ dimer. In addition, the disordered nucleotides also cor- 
respond to positions 25-28 of the target site where RNA-DNA mismatches 


Fig. 3 | Cas6 and Cas7.1 form a binding platform for TniQ. a, Top, magnified 
area showing the interaction site of Cascade and the TniQ dimer. Cas6 and Cas7.1 
are displayed as molecular Van der Waals surfaces, the crRNA is shownas grey 
spheres and the TniQ monomers are shownas ribbons. b, The loop connecting 


TniQ.1 a-helices H7 and H8 (blue) binds within a hydrophobic cavity of Cas6. 

c, Cas7.linteracts with the HTH domain of the TniQ.2 monomer (yellow), mainly 
through H2 and the loop connecting H2 and H3. d, e, Experimental cryo-EM 
densities observed for the TniQ-Casé6 (d) and TniQ-Cas7.1 (e) interactions. 
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Fig. 4|DNA-bound structure of the TniQ-Cascade complex. a, Schematic of 
crRNA and the portion of the dsDNA substrate that was experimentally 
observed within the electron density map for DNA-bound TniQ-Cascade. The 
target strand, non-target strand, PAM and seed regions are indicated (left); 
protein components are shown onthe right. b, Selected cryo-EM reference- 
free two-dimensional class averages for DNA-bound TniQ-Cascade; density 
corresponding to dsDNA could be directly observed protruding from the Cas8 
componentin the two-dimensional class averages (white arrows). c, Cryo-EM 
map for DNA-bound TniQ-Cascade. The crRNA is shown in dark grey and the 
DNA is shownin red. Right bottom, detailed views of the PAM and seed- 


are detrimental for RNA-guided DNA integration’. Thus, we propose 
that the partial R-loop structure that we observed could represent an 
intermediate conformation refractory to integration, and that further 
structural rearrangements may be critical for further stabilization of an 
open conformation, possibly driven by recruitment of the TnsC ATPase. 
Here we present cryo-EM structures of a CRISPR-Cas effector com- 
plex bound to the transposition protein TniQ, with and without target 
DNA. These structures reveal the unexpected presence of TniQ asa 
dimer that forms bipartite interactions with Cas6 and Cas7.1 within 
the Cascade complex, forming a probable recruitment platform for 
downstream-acting transposition proteins” (Fig. 4d). Our structures 
further reveal a possible fidelity checkpoint, whereby formation of a 
complete R-loop requires conformational rearrangements that may 
depend on extensive RNA-DNA complementarity and/or downstream 
factor recruitment; this proofreading step could account for the highly 
specific RNA-guided DNA integration that we previously reported for 
the V. cholerae transposon’. Inlight of recent work demonstrating exap- 
tation of type V-K CRISPR-Cas systems by similar Tn7-like transposons 
that also encode TniQ”,, it will be informative to determine whether 
tethering of TniQ to evolutionarily distinct crRNA effector complexes— 
Cascade or Cas12k—is a general theme of RNA-guided transposition. 


Online content 


Any methods, additional references, Nature Research reporting sum- 
maries, source data, extended data, supplementary information, 
acknowledgements, peer review information; details of author con- 
tributions and competing interests; and statements of data and code 
availability are available at https://doi.org/10.1038/s41586-019-1849-0. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded 
to allocation during experiments and outcome assessment. 


TniQ-Cascade purification 

Protein components of TniQ-Cascade were expressed from a pET- 
derivative vector containing the native V. cholerae tniQ-cas8-cas7- 
cas6 operon with an N-terminal His,,.-MBP-TEV site fusion on TniQ. 
The crRNA was expressed separately from a pACYC-derivative vector 
containing a minimal repeat-spacer-repeat CRISPR array encoding 
a spacer from the endogenous V. cholerae CRISPR array. The TniQ- 
Cascade complex was overexpressed and purified as described previ- 
ously’, and was stored in Cascade storage buffer (20 mM Tris-Cl, pH 7.5, 
200 mM NaCl, 1mM DTT, 5% glycerol). 


Sample preparation for electron microscopy 

For negative staining, 3 pl of purified TniQ-—Cascade ranging from 
100 nMto2 uM was incubated with plasma treated (H,/O, gas mix, Gatan 
Solarus) CF400 carbon-coated grids (EMS) for 1 min. Excess solution 
was blotted and 3 pl of 0.75% uranyl formate was added for an additional 
minute. Excess stain was blotted away and grids were air-dried over- 
night. Grid screening for both negative staining and cryo conditions was 
performed ona Tecnai-F20 microscope (FEI) operated at 200 KeV and 
equipped with a Gatan K2-Summit direct detector. Microscope opera- 
tion and data collection were carried out using the Leginon/Appion 
software. Initial negative staining grid screening allowed determina- 
tion of asuitable concentration range for cryo conditions. Several grid 
geometries were tested in the 1-4 uM concentration range for cryo 
conditions using a Vitrobot Mark-II operated at 4 °C, 100% humidity, 
blot force 3, drain time 0, waiting time 15s, and blotting times ranging 
from3-5Ss. The best ice distribution and particle density was obtained 
with 0.6/1 UltrAuFoil grids (Quantifoil). 


Electron microscopy 

A preliminary dataset of 300 images in cryo was collected with the 
Tecnai-F20 microscope using a pixel size of 1.22 A/pixel with illumina- 
tion conditions adjusted to 8 e /pixel/second with a frame window of 
200 ms. Preprocessing and image processing were integrally done 
in Relion3® with ctf estimation integrated via a wrapper to Gctf”. An 
initial model computed using the SGD algorithm” implemented in 
Relion3 was used as initial reference for a refine three-dimensional job 
that generated a sub-nanometric reconstruction with approximately 
10,000 selected particles. Clear secondary structure features in the 
two-dimensional averages and the three-dimensional reconstruction 
could be identified. 

For the DNA-bound TniQ-Cascade complex containing DNA, 
we pre-incubated two complementary 74-nt oligonucleotides: 
(NTS: 5’-TTCATCAAGCCAT TGGACCGCCT TACAGGACGCTTTGG 
CTTCATTGCTTTTCAGCT TCGCCT TGACGGCCAAAA-3’, TS: 5’-TTTTGG 
CCGTCAAGGCGAAGCTGAAAAGCAATGAAGCCAAAGCGTCCTGTAAGG 
CGGTCCAATGGCTTGATGAA-3’) for 5 min at 95 °C in hybridization 
buffer (20 mM Tris-Cl, pH 7.5, 100 mM KCI,5mM MgCl.) to form dsDNA, 
which was subsequently aliquoted and flash-frozen. Complex forma- 
tion was performed by incubating a 3x molar excess of dsDNA with 
TniQ-Cascade at 37 °C for 5 min before vitrification, which followed the 
conditions optimized for the apo complex (defined as TniQ-Cascade 
with crRNA but no DNA ligand). 

High-resolution data for the apo complex were collected in a Tecnai- 
Polara-F30 microscope operated at 300 KeV equipped witha K3 direct 
detector (Gatan). A 30-pm C2 aperture was used with a pixel size of 
0.95 A/pixel and illumination conditions in microprobe mode adjusted 
toa fluence of 16 e /pixel/second. Four-second images with a frame 
width of 100 ms (1.77 e-/A2/frame) were collected in counting mode. 


For the DNA-bound complex, high-resolution data were collected 
in a Titan Krios microscope (FEI) equipped with an energy filter 
(20 eV slit width) and a K2 direct detector (Gatan) operated at 300 
KeV. A 50-1m C2 aperture was used witha pixel size of 1.06 A/pixel and 
illumination conditions adjusted in nanoprobe mode to a fluence of 
8 e /pixel/second. Eight-second images with a frame width of 200 ms 
(1.42 e-/A2/frame) were collected in counting mode. 


Image processing 

Motion correction was performed for every micrograph applying the 
algorithm described for Motioncor2™ implemented in Relion3 with5 by 
5 patches for the K2 data and 7 by 5 patches for the K3 data. Parameters 
of the contrast transfer function for each motion-corrected micrograph 
were obtained using Gctf” integrated in Relion3. Initial particle pick- 
ing of a subset of 200 images randomly chosen was performed with 
the Laplacian tool of the Auto-picking module of Relion3, using an 
estimated size for the complex of 200 A. Then, 15,000 particles were 
extracted in a 300-pixel box size and binned 3 times for an initial two- 
dimensional classification job. Selected two-dimensional averages from 
this job were used as templates for Auto-picking of the full dataset. The 
full dataset of binned particles was subjected to a two-dimensional 
classification job to identify particles able to generate averages with 
clear secondary structure features. The selected subgroup of binned 
particles after the two-dimensional classification selection was refined 
against a three-dimensional volume obtained by SGD with the F20 data. 
This consensus volume was inspected to localize areas of heterogeneity 
that were clearly identified at both ends of the crescent shape char- 
acteristic of this complex. Both ends were then individually masked 
using soft masks of around 20 pixels that were subsequently used in 
classification jobs without alignments in Relion3. The T parameter 
used for this classification job was 6 and the total number of classes 
was 10. This strategy allowed us to identify two main population of 
particles which correspond to an open and closed state of the complex. 
Particles from both subgroups were separately re-extracted to obtain 
unbinned data sets for further refinement. New features implemented 
in Relion3, namely Bayesian polishing and ctf parameters refinement, 
allowed the extension of the resolution to 3.4, 3.5 and 2.9 A for the two 
apo and the DNA-bound complexes, respectively. Post-processing was 
performed with a soft-mask of 5 pixels being the B-factor estimated 
automatically in Relion3 following standard practice. A final set of 
local refinements was performed with the masks used for classifica- 
tion. The locally aligned maps exhibit very good quality for the ends of 
the C-shape. These maps were used for de novo modelling and initial 
model refinement. 


Model building and refinement 
For the Cas7 and Cas6 monomers, the £. colihomologues (PDB acces- 
sioncode4TVX) were initially docked with Chimera” and transformed 
to poly-alanine models. Substantial rearrangement of the finger region 
of Cas7 monomers, as well as other secondary structure elements of 
Cas6, were performed manually in COOT” before amino acid sub- 
stitution of the poly-alanine model. Well-defined bulky side chains 
of aromatic residues allowed a confident assignment of the register. 
The crRNA was also well defined in the maps and was traced de novo 
with COOT. For Cas8 and TniQ in particular, no structural similarity 
was found in the published structures that was able to explain our 
densities. Locally refined maps using soft masks at both ends of the 
crescent-shaped complex rendered well-defined maps below 3.5A 
resolution. These maps were used for manual de novo tracing of a 
poly-alanine model in COOT that was subsequently mutated to the 
V. cholerae sequences. Bulky side chains for aromatic residues showed 
excellent density and were used as landmarks to adjust the register of 
the sequence. 

For refinement, an initial step of real-space refinement against the 
cryo-EM maps was performed with the phenix.real_space refinement 
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tool of the Phenix package”, with secondary structure restraints acti- 
vated. A second step of reciprocal space refinement was performedin 
Refmac3”’, with secondary restraints calculated with Prosmart”’ and 
LibG*°. Weight of the geometry term versus the experimental term 
was adjusted to avoid overfitting of the model into cryo-EM map, as 
previously reported”. Model validation was performed in Molprobity™. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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a F20-K2 b 
Negative Stain 


Extended Data Fig. 1| Cryo-EM sample optimization and image processing 
workflow. a, Representative negatively stained micrograph for 500 nM TniQ- 
Cascade. b, Left, representative cryo-EM image for 2 1M TniQ-Cascade. A small 
dataset of 200 images was collected in a Tecnai-F20 microscope equipped with 
aGatan K2 camera. Right, reference-free two-dimensional class averages for 
this initial cryo-EM dataset. c, Left, representative image froma large dataset 
collected ina Tecnai Polara microscope equipped with a Gatan K3 detector. 
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Middle, detailed two-dimensional class averages were obtained that were used 
for initial model generation using the SGD algorithm” implemented in 
Relion3" (right). d, Image processing workflow used to identify the two main 
classes of the TniQ-cascade complex in open and closed conformations. Local 
refinements with soft masks were used to improve the quality of the map within 
the terminal protuberances of the complex. These maps were instrumental for 
de novo modelling and initial model refinement. 
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Extended Data Fig. 2| Fourier shell correlation curves, local resolution, and final map. The overlap observed between the blue and red curves guarantees a 


unsharpened filter maps for the TniQ-Cascade complex in closed non-overfitted model. c, Unsharpened map coloured according to local 
conformation. a, Gold-standard Fourier shell correlation (FSC) curve using resolutions, as reported by RESMAP™. d, Final model coloured according to 
half maps; the global resolution estimation is 3.4 A by the FSC 0.143 criterion. B-factors calculated by REFMAC’*. e, A flexible Cas8 domain encompassing 
b, Cross-validation model-versus-map FSC. Blue curve, FSC between the residues 277-385 contacts the TniQ dimer at the other side of the crescent 


shacked model refined against half map 1; redcurve, FSC againsthalfmap2,not shape. Applying a Gaussian filter of increasing width to the unsharpened map 
included in the refinement; black curve, FSC between final model against the allows for a better visualization of this flexible region. 
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Extended Data Fig. 3 | Alignment of TniQ-Cascade with structurally similar 
Cascade complexes. The V. cholerae 1-F variant TniQ-Cascade complex (left) 
was superposed with Pseudomonas aeruginosa |-F Cascade" (also knownas Csy 
complex; middle, PDB ID: 6B45) and £. colil-E Cascade’ (right, PDBID: 4TVX). 


Shown arealignments of the entire complex (top), the Cas8 and Cas5 subunits 
with the 5’ crRNA handle (second from top), the Cas7 subunit with a fragment of 
crRNA (second from bottom) and the Cas6 subunit with the 3’ crRNA handle 
(bottom). 
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Extended Data Fig. 4| Representative cryo-EM densities for all the b-h, Final refined model inserted in the final cryo-EM density for select regions 
components of the TniQ-Cascade complex in closed conformation. a, Final of all the molecular components of the TniQ-Cascade complex. Residues are 


refined model of TniQ-Cascade, with Cas8 in purple, Cas7 monomers in green, numbered. 
Cas6 in salmon, the TniQ monomers in blue and yellow, and the crRNA in grey. 
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Extended Data Fig. 5| Cas8 and Cas6 interaction with the crRNA. a, Refined 
model for the TniQ-Cascade shownas ribbons inserted in the semi-transparent 
Van der Waals surface, coloured as in Fig. 1. b,c, Magnified view of Cas8, which 
interacts with the S’ end of the crRNA. The inset shows electron density for the 
highlighted region, where the base of nucleotide Clis stabilized by stacking 
interactions with arginine residues R584 and R424. d, Cas6 interacts with the 


3’ end of the crRNA ‘handle’ (nucleotides 45-60). e, Anarginine-rich a-helix is 
deeply inserted within the major groove of the terminal stem-loop. This 
interactionis mediated by electrostatic interactions between basic residues of 
Cas6 and the negatively charged phosphate backbone of the crRNA. f, Cas6 
(salmon) also interacts with Cas7.1 (green), establishing a B-sheet formed by 
B-strands contributed from both proteins. 
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Extended Data Fig. 6 | Schematic representation of crRNA and target DNA the complex are also shown, as well as the position of each Cas7 finger. b, TniQ- 
recognition by TniQ-Cascade. a, TniQ-Cascade residues that interact with Cascade residues that interact with crRNA and target DNA, shownasina. 
thecrRNA are indicated. Approximate location for all protein components of 
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Extended Data Fig. 7 | FSC curves, local resolution, and local refined maps 
for the TniQ-Cascade complex in open conformation. a, Gold-standard FSC 
curve using half maps; the global resolution estimation is 3.5 A by the FSC 0.143 
criterion. b, Cross-validation model-versus-map FSC. Blue curve, FSC between 
shacked model refined against half map 1; red curve, FSC against half map 2, not 
included in the refinement; black curve, FSC between final model against the 
final map. The overlapping between the blue and red curves guarantees anon- 
overfitted model. c, Unsharpened map coloured according to local 


Cryo-EM map 


Refined model 


After masking 


resolutions, as reported by RESMAP”. Right, slice through the map shownon 
the left. d, Local refinements with soft masks improved the maps in flexible 
regions. Shownis the region of the map corresponding to the TniQ dimer. 
Unsharpened maps coloured according to the local resolution estimations are 
shown before (left) and after (right) masked refinements. e, Final model for the 
TniQ dimer region, coloured according to the local B-factors calculated by 
REFMAC?S, 
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Extended Data Fig. 8 | FSC curves, local resolution, and unsharpened filter 
maps for the DNA-bound TniQ-Cascade complex complex. a, Gold-standard 
FSC curve using half maps; the global resolution estimation is 2.9 A by the FSC 
0.143 criterion. b, Cross-validation model-versus-map FSC. Blue curve, FSC 
between the shacked model refined against half map 1; red curve, FSC against 
half map 2, not included in the refinement; black curve, FSC between final 


Cryo-EM ma 
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model against the final map. The overlap observed between the blue and red 
curves guarantees anon-overfitted model.c, Left, unsharpened map coloured 
according to local resolutions, as reported by RESMAP*’. dsDNA is visible at the 
top right projecting outside of the complex. Right, final model coloured 
according to B-factors calculated by REFMAC”®. 
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Extended Data Fig. 9 | Alignment of DNA-bound TniQ- Cascade with alignments of the entire complex (top), the Cas8 and Cas5 subunits with the 
structurally similar Cascade complexes. The DNA-bound structure of V. 5’crRNA handle and double-stranded PAM DNA (middle top), the Cas7 subunit 
cholerae 1-F variant TniQ-Cascade complex (left) was superposed with DNA- witha fragment of crRNA (middle bottom), and the Cas6 subunit with the 
bound structures of P. aeruginosa |-F Cascade" (also knownas Csy complex; 3’ crRNA handle (bottom). 
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Respiratory syncytial virus (RSV) and human metapneumovirus (HMPV) cause severe 
respiratory diseases in infants and elderly adults’. No vaccine or effective antiviral 
therapy currently exists to control RSV or HMPV infections. During viral genome 
replication and transcription, the tetrameric phosphoprotein P serves as a crucial 
adaptor between the ribonucleoprotein template and the L protein, which has RNA- 
dependent RNA polymerase (RdRp), GDP polyribonucleotidyltransferase and cap- 
specific methyltransferase activities”*. How P interacts with L and mediates the 
association with the free form of N and with the ribonucleoprotein is not clear for 
HMPV or other major human pathogens, including the viruses that cause measles, 
Ebola and rabies. Here we report acryo-electron microscopy reconstruction that 
shows the ring-shaped structure of the polymerase and capping domains of HMPV-L 
bound toa tetramer of P. The connector and methyltransferase domains of L are 
mobile with respect to the core. The putative priming loop that is important for the 
initiation of RNA synthesis is fully retracted, which leaves space in the active-site 
cavity for RNA elongation. P interacts extensively with the N-terminal region of L, 
burying more than 4,016 A’ of the molecular surface area in the interface. Two of the 
four helices that form the coiled-coil tetramerization domain of P, and long 
C-terminal extensions projecting from these two helices, wrap around the L protein in 
amanner similar to tentacles. The structural versatility of the four P protomers— 
which are largely disordered in their free state—demonstrates an example of a 
‘folding-upon-partner-binding’ mechanism for carrying out P adaptor functions. The 
structure shows that P has the potential to modulate multiple functions of L and these 
results should accelerate the design of specific antiviral drugs. 


The 13-kilobase nonsegmented negative-strand RNA genome of HMPV 
encodes nine proteins!. Following virus entry, nucleoprotein-RNA is 
released into the cytoplasm of the host cell and serves as a template 
for the viral polymerase to produce sub-genomic capped and poly- 
adenylated viral mRNAs, and for genome replication through an anti- 
genome intermediate. A cap 1structureis formed at the 5’ end of viral 
mRNAs by the capping and methyltransferase (MTase) domains of L??. 
Structures are available for segmented negative-strand virus polymer- 
ases suchas influenza‘, which acquire a capped primer by ‘cap snatch- 
ing’. For polymerases of the Mononegavirales order, which includes 
several major human pathogens, the structure of the rhabdovirus 
vesicular stomatitis virus (VSV) L protein was determined first? and a 
crystal structure of the MTase domain of HMPV-L was reported>. 
Co-expressing the Land P proteins (Fig. 1a, b and Extended Data Fig. 1a) 
of HMPV in Sf9 insect cells yielded a stoichiometric preparation that 
was suitable for functional and structural studies (Fig. 1c-e). A primer 


elongation assay demonstrated that wild-type (WT) L,:P complexes had 
polymerase activity (Fig. 1c). Ly;:P complexes were also tested for denovo 
RNA synthesis activity using atemplate that consisted ofa sequence from 
the3’end of the HMPV genome (le14). The major band that was detected in 
reactions, which contained Ly;:P, all four NTPs and radiolabelled UTP asa 
tracer, was 12 nucleotides (nt) inlength, consistent with position 3C being 
the dominant initiation site (Fig. 1d). This product was not detectable in 
areaction mix that contained only radiolabelled UTP and no other NTPs, 
orinareaction that contained a chainterminator, confirming that it was 
the product of de novo initiation at the promoter. L(D745A):P, in which 
Asp745 from the conserved ‘GDNQ’ motifC was mutated to Ala, was devoid 
of polymerase activity, ruling out the possibility that a contaminating 
polymerase was responsible for the observed activity. Thus, recombinant 
L:Pis active for RNA polymerization (Fig. 1c, dand Extended Data Fig. 2). 

We determined the structure of the L:P complex to a resolution of 
3.7 A using cryo-electron microscopy (cryo-EM) (Fig. le, Extended 
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Fig. 1| Overall structure of the HMPV-L:P complex. a, b, Domain organization 
of L (a) and P (b) outlining the conserved regions and motifs. Regions of 
HMPV-P that are predicted to interact with N°, M2-1, Land RNP, and 
phosphorylation sites are indicated”. c, Primer elongation assay. Sequences of 
the18-nt RNA template and for the 5-nt primer are shown with nascent RNA in 
blue and radiolabel incorporation sites in red. This experiment was performed 
atotal of four times with two different buffer conditions. d, RdRp activity assay 
using the le14 RNA template. Sequences for the 5’-triphosphorylated 25-nt and 
3-nt markers and the 12-nt product are indicated. The 3’dGTP chain terminator 


Data Fig. land Supplementary Video 1). The final three-dimensional 
reconstruction (Extended Data Fig. 3) enabled us to build an atomic 
model for residues 8-1380 of Land for the bound P tetramer (Fig. 1f, g 
and Extended Data Figs. 3-5). Residues 1381 to the C terminus are not 
visible, suggesting flexibility of the peripheral appendage® compris- 
ing CR-VI+. A comparison of HMPV-L with VSV-L? returns a root mean 
square deviation (r.m.s.d.) of 2.8 A, illustrating the preservation of 
the core L structure during the evolution of Mononegavirales (Sup- 
plementary Fig. 2). This similarity extends to the N-terminal domain 
(NTD) of VSV-L, which is essential for viral transcription® (Extended 
Data Fig. 6). The RdRp folds into the canonical fingers-palm-thumb 
subdomains observed in many RNA polymerase structures: (Fig. 2a). 
Nucleoside inhibitors have been developed that target the RdRp activ- 
ity of RSV and HMPV’. The 5’ triphosphate metabolites of ALS-8176/ 
ALS-8112 inhibit HMPV and RSV in nonhuman primates with similar 
potency and resistant RSV mutants have been sequenced’. We mapped 
the locations of RSV-L resistance mutations’ to helix a29 of the present 
HMPV-L structure (Extended Data Fig. 7). 
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is labelled G. Radiolabelled UMP is shown in red. Data are representative of 
three independent experiments. For gel source data, see Supplementary 

Fig. 1.e, Overview of the HMPV-L:P cryo-EM three-dimensional reconstruction. 
f, Overview of the L:P atomic model. RdRp, cyan; capping domain, green; 
phosphoprotein tetramer subunits: P1, magenta; P2, hot pink; P3, salmon; P4, 
pink. Atoms of the GDNQ motif in the RdRp and the ‘HR’ motif in the capping 
domains are shownas coloured spheres. g, Rotated view of the L:P atomic 
model. 


Asin VSV-L, the capping domain of HMPV-L (Fig. 2b), containing the 
HR catalytic motif”, has a large interface with the RdRp, accounting for 
the rigidity of the ‘doughnut’. Arg1264 is exposed to the solvent, owing 
tothe mobility of the C-terminal appendage. A feature shared by RdRps, 
which do not needa polynucleotide primer to initiate polymerization, 
is a priming loop (Fig. 2c, d) that supports the initiating rNTP®. In the 
influenza polymerase*”, a proline at the tip of a B-hairpin of PB1 has a key 
role in replication initiation. In rhabdoviruses (VSV and rabies virus), 
the corresponding loop is in the capping domain and contains residues 
that are important both for terminal de novo initiation and for mRNA 
capping”, witha Trp residue in the capping domain that is essential for 
de novo initiation’°—indicating it is the priming residue. In sequence 
alignments, a Tyr residue lies at a similar position in HMPV-L (Tyr1201) 
and RSV-L (Tyr1276), which could provide base-stacking interactions 
that are required for priming (Supplementary Fig. 2). We tested this 
possibility using the RSV polymerase, which initiates at positions 1 
and 3 onits promoter. A Tyr1276Ala substitution did not hamper RSV 
initiation from either site, but the substitution of either Pro1261 or 
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Fig. 2| Structure of the HMPV-L protein. a, The RdRp viewed inits front 
orientation. Fingers subdomain, blue; palm, red; thumb, green; NTD, grey. 
RdRp motifs A-G are shown. b, The capping domain of L (green ribbons) of 
HMPV-L. A superposition of HMPV-L with VSV-L? (grey ribbons) highlights the 
difference between conformations of the putative priming loops. The putative 
priming loop of HMPV-L (including Thr1192 of the GXXT motif) is shownin red 
and of VSV-L (1157-1173) is shown in gold. P, magenta. c, RNA capping motifs? 
A’-E’.d, Overall view of HMPV-L with VSV-L superimposed, highlighting the 
mobile appendage (displayed in the VSV-L orientation, but disordered inthe 


Trp1262 with Ala—corresponding to Pro1186 and Trp1187 in HMPV-L 
in motif B’ of the capping domain (Fig. 2c)—inhibited an early stage 
of RNA synthesis within the promoter region (Fig. 2e, f), suggesting 
that either this Pro or Trp residue might be responsible for priming 
de novo RNA synthesis. However, we cannot exclude the possibility 
that the N-terminal region of the loop has a structural function that is 
not directly related to de novo initiation in HMPV. 

In VSV-L, the priming loop largely occludes the central RNA- 
binding cavity (Fig. 2d), precluding the formation of along double- 
stranded RNA duplex and indicating a pre-initiation conformation’. The 
putative priming loop in our structure of HMPV-L retracts completely 
into a cavity in the capping domain, leaving ample space for several 
base pairs of a primer-template double-stranded RNA intermediate 
(Fig. 2b, d). Modelling an elongation complex in the HMPV-L tunnel 
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present structure) and rNTP (S) entry, template (T) and nascent (U) RNA 
tunnels. e, Effect of substitutions in aromatic and proline residues inthe 
putative priming loop of RSV-Lon RNA production from the +1 (circles) and +3 
(triangles) sites. Data are mean +s.e.m. of three independent experiments. 

f, Sequence conservation in the putative priming loop of L. The four residues 
that were mutated (e) are indicated by arrows. A proline residue (P1186 in 
HMPV-L) that isimportant for an early stage of RNA synthesis is conserved in 
Pneumoviridae, Paramyxoviridae and Filoviridae but not in Rhabdoviridae. 


shows that an RNA duplex can indeed be accommodated inthe central 
cavity (Fig. 3a—d). 

In the pre-initiation state of VSV-L, the connector domain (CD) and 
MTase domain havea defined orientation with respect to the core, being 
locked in place by the addition of peptide 35-106 of the VSV-P protein’. In 
HMPV-L, onelinker regionis located approximately at residues 1381-1407 
just upstream of the CD and another is found at residues 1596-1598 before 
the CR-VI+ domain (Fig. 1a); these linker regions probably account for the 
flexibility of CD and CR-VI+. Mobility of the C-terminal domain (CTD), 
which has nucleoside 5’-triphosphatase, nucleoside-2’-O- and guanine- 
N’-MTaseactivities*, might be necessary to approach the 5’ end of nascent 
mRNA for cap addition. Thus, it is possible that the N-terminal residues of 
HMPV-P (whichare disordered here) area latch that keeps the appendage 
ina fixed orientation for initiation and releases it during RNA elongation. 
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Clear density corresponding to a coiled-coil of four helices that con- 
stitute the P oligomerization region” enabled the tracing of residues 
169-266 and 171-236 for the two proximal P subunits P1 and P2inclose 
contact with L, and of residues 168-219 and 169-231 for the two distal 
P subunits P3 and P4 (Figs. 1f, g, 4a). Much of P appears to fold upon 
binding to L, as shown by the presence of several extended arms that 
are typical of large molecular assemblies such as ribosomes or viral 
particles (Figs. 1,4b and Supplementary Video 2). A large molecular 
surface area (Extended Data Fig. 4) becomes buried in the interaction, 
which mostly involves residues from P1 and P2, whereas P3 and P4 pro- 
vide few interactions (Fig. 4a). Previous structural studies mapped the 
tetramerization and L-binding regions of P to the four parallel a-helices 
(spanning residues 171-193 of P) that could be visualized using X-ray 
crystallography”. By contrast, the N-terminal M2-1 binding regions of 
P”?, and the C-terminal regions of P that bind to the ribonucleoprotein 
(RNP) were found to be intrinsically disordered in the absence of their 
respective binding partners". Although the tetramerization region of 
P appears as a rigid unit that is largely unaltered after binding to LL, a 
disorder-to-order transition inthe C-terminal regions of the two subu- 
nits proximal to Ltakes place upon binding: residues 194-213 of P1 fold 
into a B-hairpin, such that a 3-stranded antiparallel B-sheet is formed 
with residues 383-391 of the fingers subdomain of L (Fig. 4c). This 
B-hairpinis followed by along tentacle-like structure that is composed 
of three consecutive a-helices that encircle the rear aperture, through 
which rNTPs gain access to the active-site chamber (Fig. 3a). Residues 
at the C terminus of the adjacent coiled-coil-forming helix (from P2) 
project in the opposite direction towards the capping domain (Fig. 4d) 
with residues 199-205 and 207-213 making two short a-helices, and 
215-236 forming an extended helical arm (Fig. 4a, d). Only residues 
Glu181 and Lys182 from P3 make contact with Lys422 and Leu424 from 
the fingers subdomain of Land a single contact is seen between L and 
P4 (Fig. 4d). Finally, the helix-forming residues 215-236 of P2, together 
with two long and mobile helices—presumably belonging to residues 
201-217 and 205—231 of P3 and P4, respectively—form a three-helix 
bundle that is largely exposed to solvent. Consistent with the present 
observations, previous studies have suggested that residues down- 
stream of the coiled-coil region tend to form helices and constitute a 
molecular recognition element”. 

Phosphorylation by cellular kinases regulates the transcription mod- 
ulation activity of VSV-P%. Analysis of recombinant HMPV-P using tan- 
dem mass spectrometry (Extended Data Fig. 8) showed that P residues 
targeted by phosphorylation are largely solvent-exposed and changes 
in phosphorylation should only affect contacts of P with the RNP and 
M2-1 protein, not complex formation with L (Fig. 4a). 

In the Mononegavirales, the genome forms a helical nucleocapsid, 
with RNA bound inacrevice between the two N protein subdomains". 
P acts as achaperone by preventing nonspecific aggregation of the N 
protein in the presence of nonviral RNA, preserving a pool of mono- 
meric RNA-free N protein (N°) for RNP assembly”. The P-N° interaction 
involves an a-helix from HMPV-P (residues 12-28) that inserts into a 
hydrophobic groove in the CTD of the N° protein”. Here the entire 
regions that are N-terminal to the coiled-coil of the four P subunits, 
including the N-terminal a-helix (Py;p), are disordered. The tetrameric 
coiled-coil appears to be anchored to the L protein at a location such 
that these helices, at the tips of flexible polar linkers that span residues 
29-135, can bind and deliver N° to the RNA exit tunnel for encapsidating 
nascent RNA (Fig. 4e). The amino acid sequence 29-135 of P (Extended 
Data Fig. 9) bears physicochemical properties expected of a linker 
that evolved to ensure the spatial search of nascent RNA exiting L, by 
N° bound to Pyzp. In the present structure, the C-terminal regions of P 
are in nonequivalent positions (Fig. 1f, g). These segments have pre- 
viously been proposed to dislodge RNA from the RNP complex and 
feed it into the template tunnel” (Fig. 4e). Their structural asymmetry 
probably reflects different roles for the CTDs of each subunit: whereas 
the C-terminal end of subunit P2 approaches the RNA template entry 
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Fig. 3 | Model for RNA elongation by L:P from Pneumoviridae. a, Model of the 
HMPV-L:P elongation complex. Template strand, firebrick; nascent strand, 
orange. Residues from L (Lys307, Arg313 and Arg788) and from the Cterminus 
of subunit P1 (Lys at positions 224, 227, 229, 243, 250,254 and 256, and Arg241) 
forma positively charged arch that attracts rNTPs tothe rNTP entry tunnel. 

b, Cut-out view of L showing electrostatic surfaces (blue, positive; red, 
negative) and paths followed by the template and nascent RNAstrands. 

c, Tunnels and interior cavities of L depicted as a white surface enclosed by the 
L:P complex (ribbons). Cut open are various tunnel openings that lead tothe 
exterior. d, Magnified view of the tunnel that traverses the L protein. Functional 
motifs A-G and catalytic residues “*GDNQ”™ are indicated. 


tunnel (Fig. 3c) and is well placed to bind N-RNA, the corresponding 
end of P1is too distant to have a similar function (Fig. 3a). Moreover, 
the N terminus of L (residues 8-24) wraps around the template entry 
tunnel (Fig. 3c). Thus, in addition to the C-terminal helix of P2, the 
NTD of L could help to displace N to allow naked RNA to feed into the 
template tunnel (Fig. 3c). In summary, the tetrameric phosphoprotein 
binds to the monomeric L and fulfils its various adaptor functions by 
adopting an asymmetric structure (Fig. 4e) through (i) conformational 
switching, in which each of the four P subunits folds into a different 
structure, depending on its position in the molecular assembly, and 
by (ii) spatially confining structurally important N- and C-terminal 
helical elements (Py;p and P,;p) to the free and RNA-bound N protein, 
respectively, through long flexible regions that are tethered to the 
rigidly anchored coiled-coil (Fig. 4e). 

While this manuscript was under revision, the structure of RSV L:P 
was reported’®. RSV and HMPV L and P proteins have only 48% and 36% 
overall amino acid identity, respectively, but the structures are similar 
(r.m.s.d. of 1.34 A over 1,501 residues). Moreover, the putative RSV-L 
priming loop retracts toa similar position as seen here for HMPV-L"*. In 
RSV, hydrophobic residues that are critical for L:P interactions (L216, 
L223 and L227 of RSV-P) have been shown to be critical for polymerase 
function”. The corresponding region, residues 251-265 of subunit P1 
of the HMPV phosphoprotein, contains several residues that contact 
L (Fig. 4c). The overall conservation of interactions between L and P 
within the family Pneumoviridae suggests that broad-spectrum inhibi- 
tors that disrupt the L:P interface could be effective’. We note that 
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Fig. 4| The Phomotetramer and L:P interactions. a, Interactions between 
residues from P1 (magenta), P2 (hot pink), P3 (salmon), P4 (pink) of Pand the 
RdRp (cyan). b, The electron-density map (RdRp, cyan; capping domain, green; 
and P, magenta). c, A B-hairpin of P1 forms an antiparallel B-sheet with residues 
383-391 of the fingers subdomains of L (cyan). Polar contacts between PlandL 
are indicated by dashes. d, Interactions of the coiled-coil of P and other parts of 
P2, P3 and P4 with L. e, Model for RNA replication by L:P. Genomic viral RNA, 


dimeric P proteins from rabies virus that interact with the C-terminal 
region of L’ clearly differ from the tetrameric P proteins of HMPV, which 
bind to the N-terminal part of L. Whether the mode of interactions 
between L and P observed here for HMPV and for RSV” is also found 
in more-distant Mononegavirales viral families such as the Filoviridae 
requires further structural investigation. 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Cloning and expression 

HMPV L and P genes (from hMPV isolate CAN97-83, GenBank 
AY297749.1)”, codon-optimized for expression in Spodoptera fru- 
giperda 9 (Sf9) cells, were chemically synthesized and cloned into modi- 
fied pFastBacDual (Thermo Fischer Scientific) by Bio Basic. HMPV-L 
(protein sequence, GenBank AAQ67700.1) was inserted downstream of 
aStrep-tag-II followed by a TEV cleavage sequence. Strep-TEV-HMPV- 
Lwas inserted downstream of polyhedrin promoter in pFastBacDual. 
HMPV-P (protein sequence, GenBank AAQ67693.1) was inserted down- 
stream of an 8xHis-tag sequence, followed by a TEV cleavage sequence. 
Then, 8xHis-TEV-HMPV-P was inserted downstream of p10 promoter 
in pFastBacDual. We mutated pFastBacDual-HMPV-L:P by plasmid PCR 
to generate pFastBacDual-HMPV-L(D745A):P using forward primer 
5’-GACGTGTCCAAGCCTGTGAAGCTGTC-3’ and reverse primer 5’-GATG 
GACTGGTTCGCGCCGTTGAGC-3’. The mutation was confirmed by 
sequencing. Generation and isolation of the bacmid of HMPV-L:P and 
HMPV-L(D745A):P followed the instructions manual of the bac-to-bac 
expression system (Invitrogen). Viral stocks generated from purified 
bacmids were amplified and used for protein expression. 


Protein purification 

The pellet of Sf9 cells was resuspended in 1/10 culture volume in cold 
HMPV-L:P lysis buffer, 20 mM Na-HEPES, pH 7.4, 300 mM NaCl, 15 mM 
imidazole, 1mM MgCl,, 0.5 mM TCEP, and lysed by sonication. Cell 
debris was pelleted by centrifugation at 50,000g for 45 min at 4 °C. 
Supernatant was incubated with 3 ml pre-equilibrated Ni-NTA beads 
(Thermo Fisher) per litre of cell pellet for 2h at 4 °C. The beads were 
collected by centrifugation at 500g for 3 min at 4 °C. The pelleted 
beads were with 15 mM, 25 mM and 50 mM imidazole with 40, 30 
and 20 bed volumes, respectively. Washed beads were transferred 
into a gravity flow column for elution. HMPV-L:P complex was eluted 
in HMPV-L:P His-trap elution buffer containing 20 mM Na-HEPES, 
pH 7.4,300 mM NaCl, 500 mM imidazole, 1mM MgCl,, 0.5 mM TCEP, 
through gravity flow at 4 °C. Eluted fractions were analysed using 
SDS-PAGE and stained with InstantBlue (Expedeon) to visualize the 
HMPV Land P proteins (Extended Data Fig. 1a). Fractions containing 
the HMPV-L:P complex were pooled with the addition of 1/10 (mg mI) 
of TEV and subjected to dialysis against HMPV-L:P lysis buffer with 
no imidazole at 4 °C overnight. Cleaved HMPV-L:P proteins were 
incubated with 1 ml Ni-NTA beads at 4 °C for 1h. Flow-through and 
wash fractions containing L:P were pooled and subjected to heparin 
chromatography: a 5-ml HiTrap Heparin (GE Healthcare) column 
was equilibrated with 20 mM Na-HEPES, pH 7.4,300 mM NaCl, 1mM 
MgCl, 0.5 mM TCEP, and the protein sample was loaded, washed 
with the same buffer and eluted using a buffer containing 20 mM 
Na-HEPES, pH 7.4,1mM MgCl,, 1M NaCl, 0.5 mM TCEP. Fractions that 
contained pure L:P were pooled, concentrated and loaded ontoa 
Superose 6 increase 10/300 column (GE Healthcare) pre-equilibrated 
with 20 mM Na-HEPES, pH 7.4, 300 mM NaCl, 1mM MgCl,, 0.5 mM 
TCEP. Fractions collected were subjected to SDS-PAGE and instant 
blue staining. Fractions that contained the L:P complex were pooled 
and concentrated using Vivaspin15R with a 100-kDa cut-off concen- 
trator (Sartorius) toa concentration of 0.85 mg mI‘ for cryo-EM. The 
final yield for the L:P complex used for cryo-EM was 0.045 mg I}. 
During the heparin chromatographic step, a large excess of P was 
observed to flow through the heparin column. This fraction was 
further purified by size-exclusion chromatography using a superdex 
200 10/300 column (GE Healthcare) pre-equilibrated with 20 mM 


Na-HEPES, pH 7.4, 300 mM NaCl, 1 mM MgCl, 0.5 mM TCEP and the 
pure P was finally concentrated ina Vivaspin15R concentrator (10 kDa 
cut-off). The yield of free P was 0.23 mg I". To obtain protein for the 
polymerization assay, both the wild-type L:P and L(D745A):P mutant 
were purified following the same procedure, but without the hepa- 
rin chromatography step. The L protein in the L:P preparations was 
quantified by separation on SDS-PAGE and detection using colloidal 
blue staining. The molar concentration of L:P was determined with 
respecttoL. 


RdRp activity assays 

PAGE-purified RNA oligoribonucleotides representing residues 1-14 
(Fig. 1d) or 1-25 (Extended Data Fig. 2b) of the le sequence were pur- 
chased from Sigma and used as a template for in vitro RNA synthe- 
sis. Purified recombinant HMPV-L:P at a concentration of 13 nM was 
incubated with 1 1M RNA oligonucleotide in a total reaction volume 
of 20 pl in vitro transcription buffer (SO mM Tris-HCl, pH 7.5, 40 mM 
KCI, 5.0 mM MgCL,, 0.2 U pl murine RNase inhibitor, 0.01% (v/v) Triton 
X-100 and 1mM DTT). The reactions were pre-equilibrated on ice for 
30 min. Transcription reactions were initiated by the addition of rNTPs 
(1mMeach of rATP, rGTP and rCTP, and 20 pMrUTP) and 20 nM [a-**P] 
rUTP (3,000 Ci mmol“, Perkin Elmer) or as indicated in the legends 
and incubated at 30 °C for 3 h. Reactions were terminated by addi- 
tion of an equal volume of deionized formamide. After addition of 
proteinase K, the reaction mixtures were further incubated at 37 °C 
for 20 min. The reaction products were heat-denatured for 10 min at 
95 °C and a quarter of the reaction volume was resolved by denaturing 
urea PAGE (20% acrylamide, 0.5x TBE). Gels were subsequently dried 
ontoa positively charged Hybond N+ nylon membrane (GE Healthcare) 
and imaged using a phosphor imager. Data are representative of three 
independent experiments. 

The length of the RNA products of the RdRp activity assay was deter- 
mined by comparison with 3-nt and 25-nt long 5’-triphospho-oligo- 
RNAs synthesized in vitro using T7 RNA polymerase and [a-P]rATP 
incorporation. In brief, annealed complementary DNA oligonucleo- 
tides, representing aclass II T7 phage promoter followed by the antici- 
pated 25-nt trailer RNA product sequence, were incubated with T7 RNA 
polymerase (New England Biolabs) and [a-”P]rATP according to the 
manufacturer’s instructions. To generate the 3-nt long 5’-triphospho- 
oligo-RNA marker (5’-pppACG), the reaction was performed under the 
same experimental conditions except that rGTP was substituted with 
a3’-dGTP chain terminator. 

To characterize the 3’ extension activity of HMPV polymerase on 
the RNA template, the le25 template was labelled with Escherichia 
coli poly(A) polymerase (New England Biolabs) in the presence of 
[a-??P]rATP and excess of the chain-terminating analogue cordycepin 
3’-triphosphate (3’-dATP). A quantity of 1 uM RNA was incubated (37 °C, 
60 min) with 0.25 U pI £. coli poly(A) polymerase in reaction buffer 
(50 mM Tris-HCl (pH 7.9), 250 mM NaCl, 10 mM MgCl.) supplemented 
with 20 uM 3-dATP and 100 nM [a-”P]rATP (3,000 Ci mmol”, Perkin 
Elmer). 

Primer elongation reactions contained 2 uM template (containing 
a3’ phosphate group), 20 uM primer (containing a 5’-hydroxyl group) 
and 20 pMeach of rATP, rCTP, rGTP and rUTP, with 60 nM [a-”P]rGTP 
tracer ina buffer containing 50 mM Tris-HCl, pH 7.4,8 mM MgCl,,5mM 
DTT and 10% glycerol. Primer elongation was initiated by the addition 
of HMPV-L:P to aconcentration of 20 nM ina total reaction volume of 
25 pl. Reactions were allowed to proceed for 2 hat 30 °C. After heat 
inactivation for 3 min at 90 °C, reactions were heat denatured for 5 min 
at 95 °C in an equal volume of 10 mM EDTA in deionized formamide 
with 0.02% each bromophenol blue and xylene cyanol dyes. Next, 7 pl 
of the 50-p1l stopped reaction mix was subjected to denaturing urea 
PAGE (20% acrylamide, 7 M urea, 1x TBE). The gel was dried onto 3MM 
chromatography paper and the radiolabelled products were detected 
by phosphor-imaging analysis. 


Cryo-EM 
We glow-discharged Quantifoil R 1.2/1.3 holey carbon 200-mesh 
copper grids (Quantifoil Micro Tools) at 20 mA for 45s. To vitrify the 
sample, we applied 3.5 pil of the HMPV-L:P complex (both diluted to 
0.8 mg mI’) to each grid, and plunge-froze the sample in liquid ethane 
using a Vitrobot Mark IV (FEI) after a4.0-s blotting (offset —2) with filter 
papers pre-saturated for 15 min under 100% humidity. We screened the 
sample homogeneity and cryo preparation in an Oxford-style CT3500 
cryo-specimen holder (Gatan) onaliquid-nitrogen-cooling Tecnai F20 
electron microscope (FEI), operated at 200 kV and equipped witha K2 
summit direct detector (Gatan). Wescreened for thin and cleanice with 
amaximum distribution of nonoverlapping protein particles. Sample 
homogeneity was judged by visual observation and by two-dimensional 
classification (Extended Data Fig. 1). We optimized and prepared two 
sets of cryo-grids, one with the purified L:P sample and the other with 
purified L:P supplemented with the free P sample at a1:0.4 molar ratio. 
We automated data acquisition using SerialEM”° on a K2 summit 
detector (Gatan) equipped with a Tecnai Polara electron microscope 
(FEI) operated at 300 kV and at a nominal magnification of 31,000x. 
The calibrated physical pixel size was 1.24 A, corresponding to a cali- 
brated magnification of 40,323 on the camera sensor plane. We con- 
firmed beam parallelization by collecting a few videos on carbon after 
comma-free and objective astigmatism alignments and before the 
actual data collection; the Thon rings of carbon images consistently 
extended beyond 2.9-2.7 A and were of excellent stigmation (Extended 
Data Fig. 1). For all datasets, we recorded 40 frames per video in super- 
resolution mode (0.62 A per pixel) using a dose rate of 9.2 electrons 
per physical pixel per second (1.24 A per pixel), and a total exposure 
of 48 electrons per A”. We recorded a total of 7,438 and 1,100 dose- 
fractionated images for the L:P sample and L:P supplemented with P 
sample, respectively. 


Image processing 

We aligned the super-resolution video frames using MotionCor2” with 
dose filtering and 5-by-5 patched sub-frame alignment; the frames were 
binned over 2-by-2 super-resolution ‘pixels’ in Fourier space, yielding 
the physical pixel size of 1.24 A of the saved micrographs. We binned the 
motion-corrected micrographs by a factor of 4 (4.96 A per pixel) and 
low-pass-filtered to 15 A for visual examination. We discarded micro- 
graphs showing thick ice, abnormal defocus or large carbon areas. For 
particle-picking and two-dimensional classification, we down-sampled 
the micrographs to 2.48 A per pixel in Fourier space using resample_ 
mp.exe” to facilitate computation. We manually picked particles for 10 
representative micrographs using e2boxer.py” and used these boxes 
to train the auto-picking model in crYOLO. We then used the trained 
model to automatically pick particles from all micrographs with the 
crYOLO Phosaurus network. We visually reviewed the results and manu- 
ally improved the automatically picked boxes for the 10 micrographs, 
while incorporating missed representative micrographs for the next 
round training. We repeated the crYOLO training and autopicking pro- 
cedures until all particles were accurately picked and well-centred for 
all micrographs. The final L:P dataset contained a total of 5,381,943 
particles, and the L:P supplemented with P dataset contained 606,410 
particles. 

We determined the defocus values and defocus angles of the 
unbinned (1.24 A per pixel) micrographs using ctffind4”°. The Thon 
ring fitting resolution (reported by ctffind4) of most selected micro- 
graphs consistently fell into the 2.9-3.3 A range (Extended Data Fig. 1), 
indicating good data quality. We performed two-dimensional classifi- 
cation in Relion” until all high-quality two-dimensional class images 
became apparent (within 50 iterations). We only selected the best two- 
dimensional classes judged by visual examination and by the resolution 
and other statistics reported by Relion, retaining 880,724 particles for 
three-dimensional analysis. 


To speed up computation, we started the initial three-dimensional 
analysis with the data down-sampled to 2.48 A per pixel. We generated 
the ab initio model (3 classes) using Relion, and selected the best class 
as areference model for further data analysis. We performed one round 
of mode-3 alignment for the two-dimensional class images against the 
three-dimensional initial model in frealign (version 9.11)”, with CTF 
correction turned off. We then calculated projection matching using 
frealign (version 9.11)” to cross-validate the three-dimensional model 
and to confirm the two-dimensional class selection as a self-consistency 
test. We performed three-dimensional classification with 6 classes 
in Relion until convergence (within 50 iterations). Two classes were 
clearly of high quality and superimposed well, with a cross-correlation 
coefficient higher than 99%. We combined these two classes, yielding a 
total of 437,611 particles. The handedness of the maps was confirmed 
to be correct as judged by well-defined a-helices. From this point on, 
we scaled the alignment parameters to 1.24 A per pixel and used the 
full resolution data for further analysis. Relion auto-refinement of 
these particles produced a map with a resolution of 4.0 A. We further 
improved the resolution to 3.9 A after refining the per-particle motion 
inRelionand to 3.8 Aafter beam tilt refinement by treating all particles 
as one group. Refining the defocus of individual particles did not yield 
obvious improvement. We confirmed convergence of motion cor- 
rection by another round of per-particle motion refinement, which 
yielded no apparent improvement. Nonetheless, the quality of this map 
at 3.8 A resolution was excellent, as judged by main-chain continuity 
and side-chain densities (Extended Data Fig. 1g, h). We further classi- 
fied the 437,611 particles into 6 classes and discarded 4 junk classes, 
yielding a total of 302,346 particles. We auto-refined the two good 
classes individually or combined and again they overlapped nearly 
perfectly. We then gave the data more weights by adjusting the T num- 
ber in Relion and subsequently performed masked alignment, while 
correcting the Fourier shell correlation (FSC) for effects of the solvent 
mask. We visually examined the results and found out that a T number 
of 4 produced the best map without apparent signs of over-fitting. 
This, combined with another round of per-particle motion refinement, 
beam tilt refinement and per particle defocus refinement, eventually 
yielded a map at 3.7 A. Inan effort to improve the density associated 
with P, we generated masks for P and L using Chimera segger” and 
performed multi-body refinement using Relion, with the orientation 
of P relative to L fixed. We also subtracted the L signal from the two- 
dimensional images, excised P from the two-dimensional images, and 
performed three-dimensional classification without alignment. Neither 
approachimproved the P density, perhaps owing to insufficient signal 
considering the small size of P. The final map was at 3.7 A resolution 
and was sharpened with a B-factor value of -188.25 A?. The procedure 
is summarized in Extended Data Fig. 3. 

We followed the same data-processing paradigm with minor adjust- 
ments for the dataset of the L:P preparation supplemented with P. 
The classification results were consistent with the L:P dataset, leaving 
273,574 and 65,398 particles for the good classes after two-dimensional 
classification and a first round of three-dimensional classification, 
respectively. The final map after a second round of three-dimensional 
classification hada resolution of 4.1A comprising 29,376 particles and 
was nearly identical to the map from the L:P dataset. Hence, we mainly 
used the map from the L:P dataset for map interpretation, except for 
two less-well-defined C-terminal helices of two P subunits (P3 and P4). 


Model building and refinement 

We built the atomic model using the graphics programs O*8 and Coot”. 
First, we fitted the RdRp and capping enzymatic domains of the VSV-L 
model (RCSB Protein Data Bank code (PDB) 5A22) into the map as well 
as the coiled-coil domain of P (PDB 4BXT) and then used these as a guide 
to find main-chain connectivity. Local resolution analysis indicated that 
the majority of the core portion of the complex falls within a resolu- 
tion range of 3.0-3.6 A, with excellent main-chain connectivity and 


Article 


side-chain densities throughout the molecules (Extended Data Fig. 1g, 
h). The majority of the electron density showed excellent continuity 
giving confidence for tracing the a-carbon chain. Moreover, in most 
regions, the clear electron density of the side chains enabled us to 
unambiguously assign the sequence register. We observed only weak 
and disordered extra densities accounting for a portion of the CD and 
CRVI+ domains in maps that were sharpened with lower B-factors and/ 
or low-pass-filtered with lower spatial frequency, suggesting that these 
domains are highly flexible in the complexes that we examined. The 
appendage of HMPV-L might become fixed with respect to the ring- 
shaped core domain upon binding toa peptide encompassing residues 
35-106 of the VSV-P used in the VSV-L study. Indeed, a recent study of 
the rabies virus L protein in complex with the counterpart of this P pep- 
tide demonstrates such contacts between P and L (J. HorwitzandS. C. 
Harrison, personal communication). Although there were some 
stretches with weaker densities for the P protein, long side chains of 
arginine and lysine residues ensured the correct polarity and register 
assignment of the sequence. We further confirmed that the sequence 
was in register by comparing the model with the coiled-coil interface 
of the known structure of the phosphoprotein tetramerization region 
(PDB 4BXT). Overall, we traced the entire NTD subdomain and RdRp 
domain spanning residues 8-892, except for mobile residues 1-7 
(predicted to be disordered using the PrDOS prediction server http:// 
prdos.hgc.jp/cgi-bin/top.cgi), 607-625 (603-625 are predicted to be 
disordered), the complete capping enzyme domain (residues 893- 
1380) and the four P subunits (residues 169-266, 171-236, 168-219 and 
169-193/204-231 of subunits P1, P2, P3 and P4, respectively). Residues 
194-203 of subunit P4 are disordered. Whereas the sequences could be 
assigned unambiguously for Pl and P2 residues as well as for the coiled- 
coil region of P3 and P4, the sequence assignment for the C-terminal 
ends of P3 and P4 that are predicted to form helices is less certain. 
These segments might become fully ordered only in the presence of 
the ribonucleoprotein (main text). Model building was interspersed 
with refinement using the real-space refinement option in the package 
PHENIX” (rigid_body, minimization_global, local grid searching and 
ADP) for the coordinates and B-factors. We applied Ramachandran, 
rotamer and secondary structure restraints throughout the refinement, 
in addition to the standard geometry and B-factor restraints. More 
than 40 rounds of manual rebuilding and phenix real-space refinement 
were carried out until we had high confidence of the sequence register, 
density fitting, model geometry and local chemical environments. We 
analysed the final model with MolProbity” and statistics are given in 
Extended Data Table 1. We deposited the refined coordinates to wwPDB 
(6USO) and the electron density map to EMDB (EMD-20651). 


Modelling of the polymerase-RNA elongation complex 

We fitted a rotavirus in situ elongation complex (PDB 60J3) to the 
HMPV-L:P complex by superimposing the CA atoms of the five func- 
tional polymerase motifs in the program O” and refined the alignment 
using the O command Isq_improve. The relatively long RNA enabled 
us to easily identify the NTP entry, template entry, template exit and 
transcript exit tunnels in the HMPV-L structure. Modelling using elon- 
gation complexes of other polymerases such as influenza virus PB1 
(PDB 6QCT)*, the HCV NSS (PDB 4WTG)” and reovirus A3 elongation 
complex (PDB 1N35) confirmed the results. 


Tunnel identification 

We identified the tunnels in the structure using voidoo, mama and 
mapman of the Uppsala Software Factory**. We tested values for the 
probe radius inthe range of 1.4-2.0 A, and1.8A yielded the most accu- 
rate tunnels. We generated pseudo-atoms to define the range of the 
calculation and displayed the tunnel along with the models in PYMOL 
(https://pymol.org/2/). We performed object-specific clipping of 
the tunnel mask using PovRay (http://www.povray.org) to display its 
interior along with the modelled RNA. 


Tandem mass spectrometry 

We collected and analysed tandem mass spectrometry data (Extended 
Data Fig. 8) for both the purified L:P complex and free P following a 
standard protocol”. In brief, we digested 2 yg of each sample over- 
night at 37 °C with trypsin (1:100 enzyme:sample molar ratio). Digests 
were carried out in 200 mM EPPS pH8.5 in the presence of 2% acetoni- 
trile (v/v) with trypsin (Promega, V5111). After peptide purification by 
reverse-phase C18 chromatography, 1 pg of peptides was separated 
with a 4-h acetonitrile gradient by high-performance liquid chro- 
matography before injection. Data were collected using an Orbitrap 
Lumos mass spectrometer (Thermo Fisher Scientific) coupled to a 
Proxeon EASY-nLC 1200 liquid chromatography system (Thermo Fisher 
Scientific). The 75-um inner-diameter capillary column used was packed 
with C,, resin (Accucore, 2.6 1m). The scan sequence started with col- 
lection of an MS' spectrum (Orbitrap analysis; resolution 120,000; 
mass range 400-1400 Th). MS? analysis followed collision-induced 
dissociation (CE=35) witha maximum ion-injection time of 250 ms and 
an isolation window of 1.2 Da. Neutral loss of phosphate was taken into 
account by using a multi-stage activation method with a neutral loss of 
97.9763 Da. Peptides were searched with a SEQUEST-based algorithm 
against an insect database containing L and P protein sequences and 
also against common contaminants. Searches were performed witha 
target decoy database strategy and a false-discovery rate of 1% set for 
peptide-spectrum matches followed by linear discriminant analysis 
filtering and a final collapsed protein-level false-discovery rate of 1%. 
We dynamically searched for oxidized methionine (+15.9949146221 Da) 
and phospho-serine, -threonine and -tyrosine (+79.9663304104 Da). 
Phosphorylation site localization used a modified analysis of variance 
(ANOVA) score (ModScore). We repeated the experiments twice and 
the results were consistent. 


RSV priming loop experiments 

Mutations in codon-optimized RSV L (strain A2) were generated in 
a fragment of L open-reading frame subcloned into a pGEM T easy 
vector (Promega). The W1262A substitution was generated by site- 
directed mutagenesis by Genscript, the P1261A, P1274A, Y1276A sub- 
stitutions were generated using Q5 site-directed mutagenesis (New 
England Biolabs). RSV(P1261A) was generated using forward primer 
5’-ACCTACTAAGGCCTGGGTCGG-3’ and reverse primer 5’-CCCCTC 
TCGCCCCTGGTC-3’. RSV(P1274A) was generated using forward primer 
5’-CCAGGAAAAAAAGACCATGGCCGTCTACAACAGG-3’ and reverse 
primer 5’-GTGGAGGAGCCGACCCAGGGCTTAGTAGG-3’. RSV(Y1276A) 
was generated using forward primer 5’-CATGCCCGTCGCCAACAG 
GCAG-3’ and reverse primer 5’-GTCTTTTTT TCCTGGGTG-3’. The 
mutated open-reading frame fragments were sequenced and then 
substituted in the codon-optimized L open-reading frame contained 
inaT7 expression vector (pTMI1) by restriction digestion and ligation. 
All plasmids were sequenced to confirm the presence of mutations, 
and to ensure that no other changes were introduced during PCR- 
amplification steps. BSR-T7 cells, a BHK-derived cell line that con- 
stitutively expresses T7 polymerase* (provided by K. Conzelmann), 
in six-well dishes were transfected with pTMI1 plasmids expressing 
the following proteins: RSV (A2 strain) N (400 ng per well), P (200 ng 
per well), M2-1 (100 ng per well) and L (or L mutant; 100 ng per well), 
and a plasmid expressing a minigenome RNA containing nt 1-36 of 
the trailer promoter at its 3’ end (200 ng per well). The minigenome 
was limited to synthesizing only the first strand of replication by the 
lack of a promoter-specific sequence at the 3’ end of the replicative 
intermediate. The cells were transfected using Lipofectamine 2000 
(Invitrogen) according to the manufacturer’s instructions. After trans- 
fectionf for 40-48 h, cells were collected and RNA was purified using 
TRIzol, according to the manufacturer’s instructions, except that the 
RNA was subjected toa further round of purification by extraction with 
phenol-chloroform, followed by ethanol precipitation. RNA products 


initiated within the trailer promoter were analysed by primer exten- 
sion. RNA representing one-tenth of a well of cells was subjected to 
reverse transcription at 37 °C using the Sensiscript RT kit (Qiagen) and 
a radiolabelled primer (5’-TACGAGATAT TAGT TTT TGAGAC-3’). Half 
of the primer extension reaction was subjected to electrophoresis in 
8% polyacrylamide gels containing 7 M urea in 1x TBE. Radiolabelled 
oligonucleotides corresponding in sequence to cDNA representing 
initiation from positions +1 and +3 of the trailer promoter were used 
as markers (+1, 5’-TACGAGATAT TAGTT TT TGAGACTTTTTTTCTCGT-3’; 
+3, 5’-TACGAGATATTAGTTTTTGAGACTTTTTTTCTC-3’). Thesame RNA 
samples with RNA representing one-tenth of a well of cells were also 
subjected to denaturing gel electrophoresis on a1.5% formaldehyde- 
agarose gel and transferred to nitrocellulose by northern blotting. 
The input minigenome template was detected using a radiolabelled 
strand-specific riboprobe. Primer extension products were visual- 
ized by autoradiography and the primer extension products and input 
minigenome were quantified using phosphor-imaging analysis. The 
levels of input minigenome were used to normalize for transfection 
efficiency. The data were normalized to the mean of the two wild-type 
samples included in each experiment. With the caveat that the sample 
sizes had ann of 3, statistical analysis using ANOVA showed that the 
levels of +1 and +3 RNA products for the P1261A and W1262A L vari- 
ants were significantly different from those of wild-type L (P= 0.002 
and P=0.003, respectively, for +1; P= 0.004 for both variants for +3), 
whereas the levels of RNA produced from P1274A and Y1276A L variants 
were not statistically different (P= 0.0385 and P= 0.549, respectively, 
for +1; P=0.227 and P=0.518, respectively, for +3). 


Figure preparation 

We prepared the figures and videos using PyMOL (Schrédinger), 
Chimera”, Python matplotlib, gnuplot, CCP4 TopDraw” and PovRay 
(http://www.povray.org). Multiple sequence alignment was performed 
with Clustal Omega*’ (https://www.ebi.ac.uk/Tools/msa/clustalo/) and 
the alignment displayed with ESPript” using previous studies*” * to 
assign the sequence motifs. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Structure coordinates are available from PDB 6USO and the electron 
density map from the Electron Microscopy Data Bank (EMDB) under 
accession code EMD-20651. All other data generated or analysed in this 
study are available from the corresponding authors upon reasonable 
request. 
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Extended Data Fig. 1| Purification of HMPV-L:P and structure determination 
using cryo-EM. a, Representative size-exclusion chromatogram of the L:P 
complex (these experiments were repeated more than five times). Fractions 
indicated by an arrowwere collected and concentrated to 0.85 mg ml and 
used for cryo-EM analysis. Inset, SDS-PAGE followed by Coomassie blue 
staining of the purified samples. Free P protein separated from the L:P complex 
by heparin chromatography is also shown. For gel source data, see 
Supplementary Fig. 1.b, Raw micrograph of HMPV-L:P particles recorded in 
vitreous ice. Scale bar, 10 nm.c, Power spectrum of the image shown inb. We 
limited the high resolution for fitting to a spatial frequency of 1/5.0 A, and 

1/2.9 Amarks the highest spacing to which CTF rings were successfully fit. 

d, Two-dimensional classes and ‘self-consistency check’ for the cryo-EM three- 
dimensional reconstruction. For each pair of boxes (top and bottom) across 
the three rows, the top image shows one two-dimensional class average and the 
bottom image shows the corresponding projection from the initial three- 
dimensional model. e, Local resolution of the cryo-EM density map. Variations 


Amplitude 


0 0.1 0.2 0. 0.4 0.5 


in local resolution are colour-coded from blue (3.0 A) to red (5.9 A), computed 
with Resmap“. f, FSC of the cryo-EM map asa function of the spatial frequency. 
The gold-standard resolutionis 3.7 Abased onthe FSC = 0.143 criterion, 
consistent with the model-to-map correlation (0.5 criterion). g, Example of the 
electron-density map that enabled model building. The region shownis at an 
interface between the RdRp and capping domain. The map is shown asa grey 
mesh, contoured at a level of 30. The atomic model is shownas sticks with 
residues from RdRp coloured in cyan (NTD in grey) and in green for the capping 
domain. h, The region shown is the three-stranded B-sheet at the interface 
between the RdRp (cyan sticks) and the phosphoprotein (magenta sticks). The 
map is shownasa grey mesh, contoured at a level of 2.50. We observed a nearly 
identical structure of the L:P complex ina reconstruction obtained by 
premixing the L:P complex with fully phosphorylated P, indicating that 
potential exchange of P affected neither the formation nor the structure of the 
L:P complex. 


a b 3 
fo 

1e25 3'-term @ _-OH 

AAA 

UU - 

NTPs GG. 

CC, - 


25 nt > 


20 nt > 


10 nt > 


=~ 
3nt>| a 
O° wm” t 
x 
=f 3 
N 
2 
Templates 1e25 3'-UGCGCUUUUUUUGCGCAUAUUUAAU-5' 
P le25[SpC3] 3'-[SpC3]-UGCGCUUUUUUUGCGCAUAUUUAAU-5' 
25-nt marker (M25) 5'-pppACGCGAAAAAAACGCGUAUAAAUUA-3' 
3-nt marker (M3) 5'-pppACG-3' 
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purification and size-exclusion chromatography. b, Analysis ofthe3’extension the 3’ terminus of the template, in addition to engaging in de novo initiation at 
activity of HMPV polymerase using the le25 RNA template. Reactions were the promoter. The radiolabelled RNA products were visualized by phosphor 
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Extended Data Fig. 3| Flow-chart depicting structure determination using cryo-EM. Further information is provided inthe Methods. 


Extended Data Fig. 4| Phosphoprotein tetramer in complex withL. a, The L 
protein (cyan) is represented as a molecular surface and the tetrameric P 
protein subunits are represented as ribbons, following the colour codes in 
Figs. 1-3. b, Structures adopted by the four individual P subunits boundtoL, 
coloured asa blue-to-red rainbow from the N- to the C-terminal end. Secondary 
structures boundaries are noted for each subunit. c,d, Superposition of the 


tetramerization helices inthe context of the L:P complex and the free P protein. 


Structures are represented as coloured ribbons with the free phosphoprotein 
coiled-coil (PDB 4BXT) coloured in grey and the four P subunits reported in this 
study coloured according to Fig. 1. Ther.m.s.d. of the superimposition is 1.13 A 
over 88 a-carbon atoms. e, View of the complex in which L and P have been 
pulled apart to display electrostatic surfaces. f, Overall view of the L:P complex 
with Pshownas ribbons and L as electrostatic surface. The P tetramer consists 
of subunits P1 (magenta), P2 (hot pink), P3 (salmon) and P4 (pink). 
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Extended Data Fig. 5| Topology of the L:P complex. Topological depiction of grey, fingers in blue, palminred, thumb in dark green and CAP in green. The 
the secondary structure elements of L and P. Helices are depicted as tubes and four subunits of the phosphoprotein P1, P2, P3 and P4 are coloured as in Fig. 1. 
strands as arrows. The colour codeisthesameasinFig.1.TheRdRpdomainand Secondary structure boundaries are indicated. 

its subdomains and the capping domain are coloured asin Figs. 1,2: NTDin 


Extended Data Fig. 6 | View of the NTD. NTD is displayed as grey ribbons 
(colours are as in Fig. 2a). The evolutionary conserved residues clustered near 
therNTP entry tunnel play a part in transcription and are represented as sticks 
and are labelled. The equivalent region of the VSV-L superimposed on 
HMPV-L is coloured in lighter grey. 
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Extended Data Fig. 7 | Model for an elongation complex stalled by the 
addition of ALS-8176 5’-triphosphate. ALS-8176 5’-triphosphate isa 
nucleoside triphosphate analogue against RSV and HMPV currently in phase 2 
clinical trials. The “*GDNQ™ catalytic motif and the positions (A789V, L7951 
and 1796V) for which mutations conferred resistance (identified by passaging 
RSV) are mapped onto the HMPV-L structure (respectively corresponding to 
A723, V729 and V730) and displayed as sticks. These conservative mutations 
probably affect inhibitor binding by inducing a slight repositioning of the helix, 
owing to altered hydrophobic contacts with neighbouring helices. The 
sequence alignment is shown in Supplementary Fig. 2. the protein is coloured 
according to Fig. 2a, and the template and nascent RNA strands according to 
Fig. 3a. 
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Extended Data Fig. 8 | MS? spectrum of the Ser'** P phosphopeptide. a, One respectively. b, They and bion series m/zidentified inthe spectrum shownina 
MS? spectrum used for identification of the phosphorylated P peptide and their deviation from theoretical m/z are shown in the table. The present 


™DALDLLS#DNEEEDAESSILTFEER is displayed. Tandem mass spectrum (top) pattern of phosphorylation agrees with observations showing that 

and deviation (bottom) enabled detection of phosphorylation (indicated bya phosphorylation of the peptide comprising residues 100-120 of RSV-P*—in 
hash symbol) at site Ser“*. Peptides fragmented from the N terminus particular, phosphorylation of Thr108*°, which corresponds to Ser148 of 
(b-fragments) and C terminus (y-fragments) are coloured in blue and red, HMPV-P (Extended Data Fig. 9)—controls its interaction with the M2-1 protein. 
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Extended Data Fig. 9 | See next page for caption. 


Extended Data Fig. 9 | Structure-based sequence alignment of the 
phosphoprotein from HMPV (labelled HMPV-A, strain CAN97-83) and other 
known pneumoviral P proteins. HMPV-B, human metapneumovirus subgroup 
B; HRSV-A and -B, human respiratory syncytial virus subgroup A and B, 
respectively; BRSV, bovine respiratory syncytial virus; PVM, pneumonia virus 
of mice; AMPV-A and AMPV-C, avian metapneumovirus subgroup A and C, 
respectively. The following sequence accession codes from GenBank were 
used for the alignment: HMPV-A, AAQ67693.1 (used in this study); HMPV-B, 
AAQ67684.1; HRSV-A, AAX23990.1; HRSV-B, AAR14262.1; BRSV, AAL49395.1; 
PVM, AAW79177.1; AMPV-A, AAT68644.1; and AMPV-C, AAT86110.1. The 
secondary structure of HMPV-P subunit P1 (this study) is displayed above the 
alignment. Phosphorylation sites are highlighted in brown. Positively charged 
residues of HMPV_P are shaded in blue, negatively charged residues in magenta 
and hydrophobic residues 29-135 in yellow. The conserved region containing 


hydrophobic residues critical for L:P interactions is highlighted in green. 
Structural alignment of P from HMPV and RSV” showed similar overall 
tetramer organization. However, differences are observed in subunit P1 with an 
r.m.s.d. of 2.24 A over 82 residues. Although P is generally more mobile with 
weaker densities and higher B-factors compared to L, the region following the 
B-hairpin (residues 175-215 in HMPV) does adoptaslightly different 
conformation compared to RSV P1. Subunit P3 has anr.m.s.d. of 1.94 A over 45 
residues, owing toaslightly tilted C-terminal helix compared to RSV. Subunit 
P2is most similar with an r.m.s.d. of 0.92 A over 56 residues. Subunit P4 has an 
r.m.s.d. of 1.33 A over 47 residues. The eight residues of HRSV-P, which are 
crucial for the interaction with HRSV-L and substitutions of which impair viral 
replication, are shaded in dark green (data froma previous study"). With the 
exception of Asn189 (HRSV-P), for which a deletion is present in HMPV-P, these 
residues are conserved in HMPV-P and other known pneumoviral P proteins. 
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Extended Data Table 1| Cryo-EM data collection, structure refinement and model statistics 


HMPV L:P complex 
(EMDB-20651) 


(PDB 6U50) 


Data collection and processing 
Magnification (nominal/calibrated) 
Voltage (kV) 
Electron exposure (e—/A’) 
Defocus range (jm) 
Pixel size (A) 
Symmetry imposed 
Initial particle images (no.) 
Final particle images (no.) 
Map resolution (A) 

FSC threshold 
Map resolution range (A) 


Refinement* 
Initial model used (PDB code) 
Model resolution (A) 
FSC threshold 
Model resolution range (A) 
Map sharpening B factor (A’) 
Model composition 
Non-hydrogen atoms 
Protein residues 
Ligands 
B factors (A*) 
Protein 
Ligand 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 
Validation” 
MolProbity score® 
Clashscore 
Poor rotamers (%) 
Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 


31,000/40,322.58 
300 


15.62 (5.27-52.43) 


0.004 
0.717 


1.31 (98" percentile) 
2.13 
0.00 


100.00 
0.00 
0.00 


®Refinement statistics were obtained with program Phenix (phenix.real_space_refinement)°*° unless otherwise noted. 


’Molprobity score was obtained from the Molprobity online server (100th percentile is the best among structures within the resolution specified) and the current structure is in the 98th percen- 


tile (n = 27,675, 0-99 A). 


‘The assignment of Pro1041 as a cis-proline is supported by electron density showing a hydrogen bond between the carbonyl oxygen atom of Thr1040 and the N atom of Gln1353. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
L— Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection SerialEM (v3.7.1-64bit) 


Data analysis Please see the methods section for details. MotionCor2 (v1.2.1), crYOLO (v1.2.3), ctffind (v4.1.9), EMAN2 (v2.2), Relion (v3.0.4), Frealign 
(v9.11), ResMap (v1.1.4), O (v14.0.0), Coot (v0.8.8), Phenix (v1.16-3549), CCP4 (v7.0), Buster (20190301), MolProbity (v3), voidoo 
(v3.3.2), mama (v6.1.9), mapman (v7.8.5), PYMOL (v2.3.2), Chimera (v1.13.1), Pov-Ray (v3.8), matplotlib (v3.1.1), gnuplot (v5.2). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


cryo-EM maps and coordinates have been deposited and released (EMDB access code: EMD-20651 and PDB code: 6U50). Scans of raw gels deposited to 
Supplemental figures associated with this manuscript. All other data generated or analyzed during this study are available from the corresponding authors upon 
reasonable request. 


=) 
jad) 
ay 
S 
o 
= 
o 
Za) 
© 
jad) 
= 
(@) 
= 
= 
o 
70) 
fe) 
er 
=) 
a 
Za) 
= 
S| 
3 
jad) 
5 
< 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


X Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Statistical methods were not needed to pre-determine sample size for the biochemical and structural studies in this work. We collected 
multiple independent cryoEM datasets for structural analysis; all other experiments were performed multiple times with similar results. 
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Data exclusions — All data were analyzed. 

Replication All biochemical and mass spectrometry experiments were performed multiple times and resulted nearly identical results (Please see methods 
section for details). Multiple cryoEM datasets were collected and analyzed, yielding very similar results. Evidence of replicaton also included 
similar results obtained by different data processing strategies, by using different softwares, by results shown in the data processing flowchart 
(Extended Data Figure 3), and by processing different subsets (Please see the methods section for details). 


Randomization — Experimental groups were not needed for this work, and hence randomization was irrelavant. 


Blinding The investigators were blinded to group allocation during data acquisition and analysis because groups were not necessary for this work. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 
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In this Letter, the y-axis label of Fig. 4d should read ‘Event-free survival 
(%)’ rather than ‘Overall survival (%)’, as correctly described in the figure 
legend, main text and Methods. In addition, in the ‘Clinical and treat- 
ment data’ section of the Methods, the fourth sentence should read: 
“Treatment data were curated for the patients receiving one or more 
lines of either PARP inhibitor or immune checkpoint blockade therapy 
via free-text search of medical and pharmacy records followed by 
expert review ina subset of cases, as follows.’ instead of: “Treatment 
data were curated for the patients receiving one or more lines of either 
PARP inhibitor or immune checkpoint blockade therapy via free-text 
search and expert review of medical records.”. Neither of these changes 
affects the data, results or conclusions of the Letter, and the original 
Letter has been corrected online. 


Nature | Vol577 | 9 January 2020 | E1 


Corrections & amendments 


Author Correction: 
Superluminal motion ofa 
relativistic jetinthe 
neutron-star merger 
GWI170817 


https://doi.org/10.1038/s41586-019-1860-5 


Correction to: Nature https://doi.org/10.1038/s41586-018-0486-3 


Published online 05 September 2018. 


K. P. Mooley, A. T. Deller, O. Gottlieb, E. Nakar, G. Hallinan, S. Bourke, 
D. A. Frail, A. Horesh, A. Corsi & K. Hotokezaka 
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A BIOLOGIST TALKS 
TOAPHYSICIST 


Working alongside people from different fields can help in improving 
communication skills and identifying gaps in knowledge. By Kenneth S. Kosik 


sa physician-scientist, many of my 
colleagues were surprised when I 
moved my laboratory fromthe Boston 
Longwood campus at Brigham and 
Women’s Hospital in Massachusetts 
to the University of California, Santa Barbara 
(UCSB), where there is neither a medical school 
nor auniversity-affiliated hospital. 
More than a few e-mails arrived — some 
expressed puzzlement, some surprise, some 
had a wink, but all were punctuated with an 


exclamation mark. 

When I made the move ten years ago, I had 
wanted to shift my biomedical work closer to 
the interests of physical-science researchers, 
to discover broadly applicable principles 
within the framework of biology, and to grasp 
the multilayered complexity hiddenin nearly 
every question posed by biologists. 

The physical sciences are not lacking in 
Boston. In fact, they are world class. But my 
lab was on the ‘medical’ side of the Charles 
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River, practically an ocean away from the 
physical-science labs at Harvard University in 
Cambridge, Massachusetts, on the other side. 
So, engaging scientists in physics and chemis- 
try, or in computer science and engineering, 
was challenging — especially for a medical 
doctor trained rather narrowly in molecular 
and cellular biology. 

My goal was simply to opena conversation 
and possibly a collaboration with physicists, 
not to become one. As a relatively small 
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institution with a distinguished faculty in 
both the physical sciences and engineering, 
UCSB was, | felt, an ideal place to wade into 
this territory. Over the ensuing decade, this 
risky move has resulted in my sharing many 
graduate students and postdoctoral fellows 
with computer-science, engineering, physics 
and chemistry faculty members. These collab- 
orations have greatly broadened the science 
in all my publications. 

I devised a few simple rules to help the 
biologist in me to cross the divide between 
the life and physical sciences. In learning to 
talk to physicists, I discovered that | can com- 
municate better with everyone and clarify for 
myself what I do and do not understand in my 
own field. 


Understand what ‘I do not understand’ 
means. When physicists say they do not 
understand something that you have said 
about biology, it’s possible that you do not 
understand that topic either. 

‘Understanding’ operates at different planes 
in different disciplines, and when a physicist 
seeks understanding, what they hope to 
grasp might differ from the knowledge that 
a biologist seeks. 

For example, a biologist understands gene 
transcription by identifying specific transcrip- 
tion factors, their binding sites, the roleof RNA 
polymerase and the genes that get activated. 

For the physicist, these crucial facets of 
transcription — specific gene names and 
binding sites — are extraneous details. Instead, 
among the questions they consider important 
are the probability distributions associated 
with attracting transcription complexes to 
specific sites and the quantification of the 
forces involved in this process. 

Clearly, questions of this nature are of keen 
interest to biologists once we sidestep our love 
affair with our favourite gene. 


Seek common ground. Whena physicist says 
they do not understand an aspect of biology, 
they are not requesting a ‘biology 101’ expla- 
nation. In my experience, when physicists ask 
a biology question, they want to apply the 
thinking of physics to biology; specifically, 
they are searching for universal, mathematical 
explanations. 

Physicists move away from settled ques- 
tions. In biology, much less seems settled. 
Emphasizing what you knowis less interesting 
than saying what you need to learn. 

Many of the missing pieces in biology are 
quantitative details, suchas the absolute copy 
numbers ofa protein or anRNA inasingle cell 
that kinetically mediates some function, and 
how the cell keeps track of so many regulatory 
dials. Do cells perform regular maintenance 
ona parts-replacement calendar, as is done 
for aeroplanes, or is damage the only trigger 
for replacing parts? Does damage tend to 
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The fastest way to 
make a physicist’s eyes 
glaze over is to recite 
biological jargon.” 


occur according to a clock? Identifying these 
unknowns in biology is more stimulating than 
recounting textbook-level information. 


Recognize the posture of false modesty. I have 
found that physicists often display a false mod- 
esty regarding their knowledge of biology. A 
physicist will say biology is much more difficult 
than physics because there is alot to memorize. 
They might say: you know an encyclopaedia of 
detail, and alll know are a few equations. 


Keepin mind the maths shortfall in biology. 
For most biological phenomena, we don’t have 
precise equations — unlike in physics. This is 
not to say that we don’t have maths, but our 
field needs alot more detailed quantification. 
This lack is the Achilles heel of biology, and 
affects even the concepts we use every day. 

It is a rare biologist who can explain the 
negative binomial used in RNA sequencing, 
parameterize organelle shape and distance 
scales when assessing active transport and dif- 
fusion, or interpret the phase diagrams that 
correspond to biomolecular condensates —a 
surging interest in biology. 
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Deep computational knowledge is a huge 
asset for biologists, but many have not had 
the opportunity to hone these skills or even be 
introduced to them. In my experience, open- 
ing aconversation and sharing students with 
physical scientists is step one towards bridging 
this gap. 


Don’t be flummoxed by physicists’ maths. 
In discussing their own work, physicists will 
often reach for a formula. After they write 
the equation and stare at it as if pondering a 
Mark Rothko painting, they might proffer an 
explanation. You can have a productive con- 
versation without the bread-and-butter basics 
of physics, such as Hamiltonians or modelling 
an Ising phase transition. Bear in mind that 
physicists can be more sensitive about the 
maths than are mathematicians, who are not 
bound by reality and see the world as a con- 
straint. 

Albert Einstein said: “As far as the laws of 
mathematics refer to reality, they are not 
certain; and as far as they are certain, they do 
not refer to reality.” The biologist must have 
substantial comfort with uncertainty. Com- 
pared with the good predictions from theory 
in physics, prediction in vast facets of biology, 
such as evolution, is more like stock-market 
cycles — discernible only in retrospect. 


Scale matters. Consider this matter of 
perspective: most of the time, what a physi- 
cist does is much smaller or much bigger, or 
much colder or much hotter, than anything 
we biologists do. 


SONIA FERNANDEZ 


Biologists should know the sizes of 
macromolecules — the organelles and the 
cells that they work with — and first consider 
how they scale (or do not scale) with systems 
derived from physics. 


Consider precision. Physicists require much 
greater certitude to draw a conclusion. They 
might be a bit smug when you claim signifi- 
cance witha Pvalue of less than 0.05. For them, 
attaining a result with a P value of 0.05 is like 
the chance of hitting a barn door with a base- 
ball: you can’t miss it. 


Avoid jargon. The fastest way to make a phys- 
icist’s eyes glaze over is to recite biological 
jargon, suchas gene lists and pathways. Come 
to think of it, it’s the fastest way to make any- 
one’s eyes glaze over. 


Skip some details. Physicists do not want to 
know about all your controls. They want the 
concepts, and are willing to assume you have 
performed the appropriate control experi- 
ments and have chosen the correct methods. If 
you're doing a ‘chalk talk’ (illustrated presenta- 
tion) for physicists, be sure to emphasize your 
ideas more than detailed data and techniques. 
Physicists make judgements on the basis of 
clear, cogent, compelling ideas. 


Manage expectations. Physicists expect you 
to build your own equipment. If this topic 
comes up with a theorist, feel free to ignore it. 
If it comes up with an experimentalist, you will 
have to concede that you paid a huge mark-up 
for off-the-shelf equipment. 


Understand optimization versus the ‘good- 
enough’ principle. Biological processes carry 
evolutionary baggage, and therefore arrive 
at solutions that might not be optimal. If, 
for example, you were designing a machine 
that walks, it might not be best to model it 
on human walking, which is not necessarily 
the most efficient way to get somewhere. In 
the absence of evolution, the physical world 
is often assumed to operate according to 
optimized parameters. By contrast, biology 
operates in what has been called a Wadding- 
ton terrain: processes function well even when 
they are not as efficient as possible — as long 
as they are still good enough to work. 


Consider a physicist’s perspective on reduc- 
tionism. Reductionist approaches to complex 
biological systems do not necessarily mean 
reducing those systems to small units. Systems 
approaches to biology that involve large num- 
bers of variables can be inherently interest- 
ing to physicists because they conceptually 
resemble problems in statistical mechanics 
that relate microscopic and macroscopic prop- 
erties with very large degrees of freedom — and 
exact solutions are not possible. If a biologist 


can frame a problem in these terms, it will 
probably engage a physicist. 


Physicists laugh alot. Not only is the humour 
of physicists arcane, but almost anything 
unexpected can provide a jocular moment. 
Theirs are the ultimate inside jokes, which are 
often not obviously funny. But laugh along 
anyway — even if you don’t find the humour, 
they won't know the difference. 


The future 


To conclude, let me say this path is not for 
everyone. Knowing yourself is the crucial 


element. If you are uncomfortable engaging 
with specialists in fields in which you are not 
trained, or if you don’t want to delve into topics 
that might not immediately bear a clear con- 
nection to your established domain, then the 
course | took is not for you. However, if you are 
not only comfortable with this approach, but 
require stimulation from outside your imme- 
diate realm of investigation, there is nothing 
better than to look in unexpected places. 


Kenneth S. Kosik is a professor of 
neuroscience at the University of California, 
Santa Barbara. 


A PHYSICIST TALKS 
TO ABIOLOGIST 


The fastest way to understand a new discipline is 
toembed yourself in it. By Sarah Bohndiek 


t’s amazing how many of biologist Ken 

Kosik’s points in his accompanying careers 

column (page 281) resonate with my expe- 

rience as a physicist working in the oppo- 

site direction. I enjoyed my training in 
physics largely because it didn’t involve rote 
learning (needed by biologists to get that 
encyclopaedic knowledge) and it allowed me 
to derive much of the material from a few key 
equations and principles. 

In his column, Ken notes that when a phys- 
icist says they do not understand something 
in biology, they are not requesting a lengthy 
‘biology 101’ explanation. But |do sometimes 
want a biology 101 refresher so I can gather 
a deeper understanding of the biological 
problem while I think about howto address it. 

Ken also talks about being uncomfortable 
with uncertainty. Personally, I can be quite 
comfortable with uncertainty when! can take 
steps to control it; it’s just that, in biological 
systems, Inormally have to accept thatI can’t. 


A deeper understanding 


The PhD I received from University College 
London in 2008 was in radiation physics, with 
a focus on evaluating the potential of CMOS 
image sensors (the silicon behind most smart- 
phone cameras) for application in medical 
X-ray-diffraction studies. 

Most of this research took place in a dark 
room, where I examined the optical response 
of the image sensors, but because I was ina 
department of medical physics and bioengi- 
neering, | was exposed to a range of biomedical 
challenges — including cancer detection. 
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AsI progressed through my studies, I realized 
that I wanted to gain a deeper understanding 
of cancer itself and how we might exploit its 
biology so we could use non-invasive imaging 
to find the disease. 

Ihadn’t studied biology since I was 16, so 
I started looking for postdoctoral positions 
in ‘friendly’ environments — biophysics, bio- 
engineering andso on. But! soon realized that, 
to truly understand cancer, I needed to break 
out of my comfort zone and immerse myself 
in that environment. 

My first postdoc was in Kevin Brindle’s 
biochemistry laboratory at the University of 
Cambridge, UK. I learnt howto hold a pipette, 
run a magnetic resonance imaging system, 
conduct animal experiments and design biol- 
ogy studies. 

In return, I fixed temperamental magnets, 
wrote hardware-control code and super- 
vised some of the lab’s more physical-scienc- 
es-oriented projects. After three years, | had 
a much greater understanding of cancer bio- 
chemistry and howto conduct in vivo imaging 
studies, but! was missing exposure to clinical 
applications and connection to my previous 
research home in optics. 

My second postdoc, with Sam Gambhir in 
the department of radiology at Stanford Uni- 
versity, California, took me back to my roots. | 
explored new projects in optics, trying to make 
imaging with optics faster and cheaper than 
before and opening newhorizons in fields such 
as endoscopy. 

As part of Stanford’s Molecular Imaging Pro- 
gram,Isawteams build newtechnologiesinthe 
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lab and take them through to first-in-human 
studies. | was inspired by Sam’s dedication 
to revolutionizing early cancer detection 
and his incredible passion for transforming 
technology into clinical trials. That passion 
still drives me in my current role at the Uni- 
versity of Cambridge, UK, where the mission 
of my interdisciplinary team is to advance our 
understanding of tumour evolution using 
next-generation-imaging sciences. We oper- 
ate between the department of physics and 
the Cancer Research UK Cambridge Institute, 
and this gives us access to a community of 
optical physicists as well as cancer biologists, 
and atranslational pipeline through to clinical 
application. 


Build synergy 


I’matechnology geek at heart. I love to diveinto 
the underlying physics when! design optics 
and to explore the engineering of new devices; 
but, for me, the story is never complete without 
seeing these methods through to application. 
I feel that the research path! explore todayisa 
synergy of all my experiences. 

lam often asked what I learnt as I traversed 
this path from physics to medicine. The fol- 
lowing are the key steps I had to take during 
my own personal development, which pre- 
pared mewellto embrace my interdisciplinary 
career. 


Learn the language. Just as with a foreign 
language, the fastest way to become fluent in 
the language of anewscientific discipline is to 
embed yourself inside it and practise regularly. 
For me, learning the language was the most 
important reason why I moved from physics 
to biochemistry for my first postdoc. 


Get comfortable being uncomfortable. | went 
from being one of the most knowledgeable 
people in my research field at the end of my 
PhD to knowing less than most of the first-year 
PhD students in my new lab. For me, that meant 
Iwas doing something right, but you do have to 
be OK with taking that hit and spending time 
building a new skill set. 


Don’t forget that you bring unique skills. 
I had a lot to learn along my journey, but I 
brought useful skills at each stage. My quan- 
titative mindset was valuable in designing 
imaging experiments, my engineering skills 
were useful for fixing broken equipment and 
my bottom-up physics approachto problems 
offered a fresh perspective. 


Test biological-research hypotheses. Some- 
times you'll bring a quantitative perspective 
that enables you to test a hypothesis or embark 
on a research topic that wasn’t accessible 
before. Don’t be afraid to ask the biological 
question yourself rather than waiting for your 
collaborators to provide an answer. 
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Astrong element of 
academic achievement 
is community 
recognition and 
networking.” 


Ask questions. I had to, and there are no stu- 
pid ones. | also attended introductory lectures 
for first-year undergraduates to fill in gaps 
in my knowledge. Going back to school and 
admitting what you do not knowis crucial. 


Embrace uncertainty. I continue to be amazed 
by the complexity of biology and hence the 
uncertainty in results. Unfortunately, it’s likely 
that I’ll never be as certain about anything in 
biology as I can be about new discoveries in 
physics. 


Learn statistics. Having a quantitative back- 
ground doesn’t necessarily mean you really 
understand statistics. | had to dive back 
into statistics to design the best biological 
experiments. 


Don’t lose touch with your roots. Being able 
to continually innovate at the interface of 
physics and biology means staying grounded 
in the physical sciences, but this can be hard to 
dowhen youare ina biological department. To 
compete for faculty positions, you need to be 
able to teach your subject, so find ways to keep 
it fresh in your mind, as I did, for example, 
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by teaching physics in undergraduate or 
graduate courses. 


Do not blindly accept dogma. Challenging 
prevailing ideas in biology using your per- 
spective can bring about revolutions. I greatly 
admire colleagues who have upturned dec- 
ades of accepted dogma using quantitative 
methods that were not even considered by the 
biological community. 


Perfect your pitch. Working across disciplines 
means you'll inevitably be talking at confer- 
ences and meetings to groups of scientific 
non-specialists; the best communicators 
perfect a balance of generalist and special- 
ist material in presentations and make their 
language accessible to all. 


Avoid equations during presentations. It’s 
not just biologists who would prefer not to 
see them. Illustrate equations with graphs and 
visuals as much as possible. If you do have to 
include equations, write them ona board in 
real time so your audience can follow your 
guided explanation — don’t just flash them 
up on the screen. 


Find agood mentor. Finally, and most impor- 
tantly, | would not be wherel am today without 
my incredible mentors. When you move disci- 
plines, you need people who can help you to 
integrate into anewcommunity, show you how 
tonavigate anew funding landscape and advise 
on expectations for junior faculty members, 
when the emphasis on publication and impact 
varies dramatically between disciplines. 


The future 


More unified training, encompassing physics 
and engineering applied to biology and medi- 
cine, will help to equip the next generation of 
researchers with the skills they need to operate 
across these disciplinary boundaries. 

Istill advocate for specialist training in one 
field as an undergraduate and then broadening 
research horizons gradually through advanced 
academic training. 

Astrong element of academic achievement 
is community recognition and networking; 
moving too rapidly between fields can make 
it challenging to build a reputation for excel- 
lence. Imaging is kind in this respect: regard- 
less of how you form animage, there are many 
commonalities in the research that mean it is 
relatively easy to tell a story within your career 
and maintain connections along that journey. 

Building that story and your profile will 
ultimately be crucial, no matter what path 
you choose to take. 


Sarah Bohndiek leads an interdisciplinary 
team combining physicists, engineers, 
chemists and molecular biologists at the 
University of Cambridge, UK. 
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usually apologize for the mess when 
people enter my office. There are dozens 
of periodic tables, hundreds of plastic 
bottles, pictures all over the walls, books 
going up to the ceiling and various old 
bits of apparatus that people wanted to 
throw out. 

I’magreen chemist, interested in 
processes that generate as few hazardous 
by-products as possible. I’m part of a big 
project called Photo-Electro, funded by 
Britain’s Engineering and Physical Sciences 
Research Council, that aims to minimize the 
amount of chemicals and solvents, and 
the number of processing steps, needed to 
construct complex molecules. For example, 
we study ways to use light and electricity 
to produce chemicals more efficiently. 

But I’m perhaps better knownasa 
popularizer of the periodic table. | have 
all sorts of objects inspired by it in my 
office: a large collection of periodic-table 
neckties, anow-empty box of periodic- 
table chocolates, and even a periodic table 
engraved ona strand of my own hair. 

lalso have a lot of toys that I play with: 
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a dog toy that looks like carbon-60, various 
Rubik’s cubes — I can’t do Rubik’s cubes, but 
they’re fun to fiddle with — and silly things 
like a plastic frog that, when you squeeze it, 
produces fake spawn. 

A big challenge for all scientists is to come 
up with new ideas. Workshops on creativity 
usually have toys lying around. Nobody 
is suggesting that a toy frog is necessarily 
going to inspire us to, say, use frog slime to 
produce a reaction. But I think that having 
lots of pictures, lots of toys hanging around 
is agood way of relaxing the mind and 
fostering innovation. 

As for the plastic bottles, I had along- 
running project trying to make the plastic in 
a greener way. We succeeded, but ona small 
scale, and the company decided it wasn’t 
worth scaling up. Nonetheless, I learnt a 
lot, and the bottles are a good conversation 
starter. As it turns out, quite alot of people 
enjoy being in my office. 


Martyn Poliakoff is a green chemist at the 
University of Nottingham, UK. Interview by 
Amber Dance. 


